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NOVEL STREPTOCOCCUS ANTIGENS 
FIELD OF THE INVENTION 

The present invention is related to antigens, more 
particularly protein antigens of streptococcus 
pneumoniaepathogen which are useful as vaccine components 
for therapy and/or prophylaxis. 

BACKGROUND OF THE INVENTION 

pneumoniae is an important agent of disease in man 
especially among infants, the elderly and immunocompromised 
persons. It is a bacterium frequently isolated from 
patients with invasive diseases such as 

bacteraemia/ septicaemia, pneumonia, meningitis with high 
morbidity and mortality throughout the world. Even with 
appropriate antibiotic therapy, pneumococcal infections 
still result in many deaths. Although the advent of 
antimicrobial drugs has reduced the overall mortality from 
pneumococcal disease, the presence of resistant pneumococcal 
organisms has become a major problem in the world today. 
Effective pneumococcal vaccines could have a major iirpact on 
the morbidity and mortality associated with pneumoniae 
disease. Such vaccines would also potentially be useful to 
prevent otitis media in infants and young children. 

Efforts to develop a pneumococcal vaccine have generally 
concentrated on generating immune responses to the 
pneumococcal capsular polysaccharide. More than 80 
pneumococcal capsular serotypes have been identified on the 
basis of antigenic differences. The currently available 
pneumococcal vaccine, comprising 23 capsular polysaccharides 
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that most frequently caused disease, has significant 
shortcomings related primarily to the poor immunogenicity of 
some capsular polysaccharides, the diversity of the 
serotypes and the differences in the distribution of 
5 serotypes over time, geographic areas and age groups. In 
particular, the failure of existing vaccines and capsular 
conjugate vaccines currently in development to protect young 
children against all serotypes spurres evaluation of other 
pneumoniae components. Although immunogenicity of 
10 capsular polysaccharides can be improved, serotype 

specificity will still represent a major limitation of 
polysaccharide-based vaccines. The use of a antigenically 
conserved immunogenic pneumococcal protein antigen, either 
by itself or in combination with additional components, 
15 offers the possibility of a protein-based pneumococcal 
vaccine . 



20 



25 



30 



PCT Publication number WO98/18930 published may 7 1998 
entitled "Streptococcus Pneumoniae antigens and vaccines" 
describes certain polypeptides which are claimed to be 
antigenic. However, no biological activity of these 
polypeptides is reported. 

Therefore their remains an unmet need for Streptococcus 
antigens that may be used as vaccine components for the 
prophylaxis and/or therapy of Streptococcus infection. 

SUMMARY QP THE INVENTION 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2. 4, 6, 8, 10 # 14, 16, 55 
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to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof . 

In other aspects, there are provided vectors comprising 
5 polynucleotides of the invention operably linked to an 
expression control region, as well as host cells 
transfected with said vectors and methods of producing 
polypeptides comprising culturing said host cells under 
conditions suitable for expression. 

10 

In yet another aspect, there are provided novel 
polypeptides encoded by polynucleotides of the invention. 



15 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is the DNA sequence of BVH-3 gene; SEQ ID NO: 1. 

Figure 2 is the amino acid sequence of BVH-3 protein; SEQ ID 
20 NO: 2. 

Figure 3 is the DNA sequence of BVH-11 gene; SEQ ID NO: 3. 

Figure 4 is the amino acid sequence of BVH-11 protein; SEQ 
25 ID NO: 4. 

Figure 5 is the DNA sequence of BVH-28 gene; SEQ ID NO: 5. 

Figure 6 is the amino acid sequence of BVH-28 protein; SEQ 

30 ID NO: 6. 

i 

Figure 7 is the DNA sequence of BVH-3 A gene which 
corresponds to the 5' terminal end of BVH-3; SEQ ID NOi 7. 
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Figure 8 is the amino acid sequence of BVH-3A protein; SEQ 
ZD NO: 8. 

Figure 9 is the DNA sequence of BVH-3B gene which 
5 corresponds to the 3' terminal end of BVH-3; SEQ ID NO: 9. 

Figure 10 is the amino acid sequence of BVH-3B protein; SEQ 
ID NO» 10. 

10 

Figure 11 depicts the comparison of the predicted amino 
acid sequences of the BVH-3 open reading frames from WU2, 
RX1, JNR.7/87, SP64, P4241 and A66 pneumoniae strains 
by using the program Clustal W from MacVector sequence 
15 analysis software (version 6.5). Underneath the alignment, 
there is a consensus line where * and . characters indicate 
identical and similar amino acid residues, respectively. 



20 



25 



Figure 12 depicts the comparison of the predicted amino 
acid sequences of the BVH-11 open reading frames from WU2, 
Rxl, JNR.7/87, SP64, P4241, A66 and SP63 S, pneumoniae 
strains by using the program Clustal W from MacVector 
sequence analysis software (version 6.5). Underneath the 
alignment, there is a consensus line where * and 
characters indicate identical and similar amino acid 
residues, respectively. 



Figure 13 depicts the comparison of the predicted amino 
acid sequences of the BVH-11 proteins from various S^. 
30 pneumoniae strains. The degrees of identity (I) and 

similarity (S) were determined by using the program Clustal 
W from MacVector sequence analysis software (version 6.5). 

Figure 14 is a DNA sequence containing the complete BVH-3 
35 gene (open reading frame "ORF" at nucleotides 1777 to 
4896) ; SEQ ID NO* 11. 
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Figure 15 is a DNA sequence containing the complete BVH-11 
gene (ORF at nucleotides 45 to 2567); SEQ ID NO: 12. 

5 Figure 16 is a DNA sequence containing the complete BVH-11- 
2 gene (ORF at nucleotides 114 to 2630); SEQ id NO: 13. 

Figure 17 is the amino acid sequence of BVH-11-2 protein; 
SEQ ID NO: 14. 

10 

Figure 18 is the DNA sequence of SP63 BVH-3 gene; SEQ id 
NO: 15. 



Figure 19 is the amino acid sequence of SP63 BVH-3 protein; 
15 SEQ ID NO: 16. 

Figure 20 is the amino acid sequence of BVH-3M protein; SEQ 
ID NO: 55. 

20 Figure 21 is the amino acid sequence of BVH-3 AD protein; 
SEQ ID NO: 56. 

Figure 22 is the amino acid sequence of L-BVH-3-AD protein; 
SEQ ID NO: 57. 

25 

Figure 23 is the amino acid sequence of NEW12 protein; SEQ 
ID NO: 58. 



Figure 24 is the amino acid sequence of BVH-3C protein; SEQ 
30 ID NO: 59. 



Figure 25 is the amino acid sequence of BVH-11M protein; 
SEQ ID NO: 60. 

35 Figure 26 is the amino acid sequence of BVH-11A protein; 
SEQ ID NO: 61. 
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Figure 27 is the amino acid sequence of BVH-11B (also 
called Newl3) protein; SEQ ID NO: 62. 

5 Figure 28 is the amino acid sequence of BVH-11C protein; 
SEQ ID 130 : 63. 

Figure 29 is the amino acid sequence of NEW1 protein; SEQ 
ID NO: 64. 

10 

Figure 30 is the amino acid sequence of NEW2 protein; SEQ 
ID NO: 65. 



Figure 31 is the amino acid sequence of NEW3 protein; SEQ 
15 ID NO: 66. 



Figure 32 is the amino acid sequence of NEW4 protein; SEQ 
ID NO: 67. 

20 Figure 33 is the amino acid sequence of NEW5 protein; SEQ 
ID NO: 68. 

Figure 34 is the amino acid sequence of NEW6 protein; SEQ 
ID NO: 69. 

25 

Figure 35 is the amino acid sequence of NEW7 protein; SEQ 
ID NO» 70. 



Figure 36 is the amino acid sequence of NEW8 protein; SEQ 
30 ID NO: 71. 



Figure 37 is the amino acid sequence of NEW9 protein; SEQ 
ID NO: 72. 

35 Figure 38 is the amino acid sequence of BVH-11-2M protein; 
SEQ ID NO: 73. 
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Figure 39 is the amino acid sequence of NEW10 protein; SEQ 
ID NO: 74. 

Figure 40 is the amino acid sequence of NEW11 protein; SEQ 
ID NO: 75. 

Figure 41 is the DNA sequence of NEW12 gene; SEQ ID NO: 76. 

Figure 42 is the amino acid sequence of NEW14 protein; SEQ 
ID NO: 77. 



Figure 43 is the amino acid sequence of NEW15 protein; SEQ 
ID NO: 78. 

15 

Figure 44 is the amino acid sequence of NEW16 protein; SEQ 
ID NO: 79. 

Figure 45 is the DNA sequence of GBS BVH-71 gene; SEQ ID 
20 NO: 80. 

Figure 46 is the amino acid sequence of GBS BVH-71 
protein; SEQ ID NO: 81. 

25 Figure 47 is the DNA sequence of GAS BVH-71 gene; SEQ ID 
NO:82. 

Figure 48 is the amino acid sequence of GAS BVH-71 protein; 
SEQ ID NO: 83. 

30 

DETAILED DESCRIPTION OF THE INVENTION 

i 

According to one aspect, the present invention provides an 
35 isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOst 2, 4, 6, 8, 10, 14, 16, 55 

7 
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to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 95% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 
to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 8, 10, 14, 16, 55 to 
75, 77 to 79, 81, 83 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 10, 14, 16, 55 to 
75, 77 to 79, 81, 83 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 8, 10, 14, 16, 55 to 
75, 77 to 79 or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs : 2, 8, 10, 16, 55, 56, 57, 
58, 59, 64, 65, 66, 78 or fragments, analogs or derivatives 
thereof . 
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According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 8, 10, 16, 55, 56, 57, 
59, 64, 65/ 66, 78 or fragments, analogs or derivatives 
thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 4, 14, 58, 60, 61, 62, 63, 

67, 68, 69, 70, 71, 72, 73, 74, 75, 77, 79 or fragments, 
analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 4, 14, 60, 61, 62, 63, 67, 

68, 69, 70, 71, 72, 73, 74, 75, 77, 79 or fragments, 
analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 10, 14, 16, 55 to 
75, 77 to 79 or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least; 70% identity to a second polypeptide comprising 
sequence chosen from SEQ ID NOs: 10, 55 to 75, 77, 78, 79 
or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
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isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence chosen from SEQ ID NOs: 55 to 75, 77, 78, 79 or 
fragments, analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 6, 8, 10 or 
fragments, analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 10, 14, 16 or 
fragments, analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 14, 16 or fragments, 
analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 2 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 4 or fragments, analogs or derivatives 
thereof. 
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According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 10 or fragments, analogs or derivatives 
thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 14 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 16 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 58 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 60 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 62 or fragments, analogs or derivatives 
thereof. 
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According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 64 or fragments, analogs or derivatives 
5 thereof . 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
10 sequence SEQ ID NO: 67 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
15 least 70% identity to a second polypeptide comprising 

sequence SEQ ID NO: 68 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention provides an 
20 isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 69 or fragments, analogs or derivatives 
thereof . 

25 According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 72 or fragments, analogs or derivatives 
thereof . 

30 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 74 or fragments, analogs or derivatives 
35 thereof . 
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According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 77 or fragments, analogs or derivatives 
5 thereof . 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NOs: 2, 4, 6, 8, 10 or fragments, 
10 analogs or derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 75, 
15 77 to 79, 81, 83 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
20 chosen from SEQ ID NOs: 2, 4, 8, 10, 14, 16, 55 to 75, 77 
to 79, 81, 83 or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
25 chosen from SEQ ID NOs: 2, 4 , 10, 14, 16, 55 to 75, 77 to 
79, 81, 83 or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
30 chosen from SEQ ID NOs: 2, 4, 8, 10, 14, 16, 55 to 75, 77 
to 79 or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
35 chosen from SEQ ID NOs: 2, 4, 10, 14, 16, 55 to 75, 77 to 
79 or fragments, analogs or derivatives thereof. 

13 
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According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NOs: 2, 4, 10, 14, 16 or fragments, 
5 analogs or derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO* 2 or fragments, analogs or 
10 derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 4 or fragments, analogs or 
15 derivatives thereof. 



According to one aspect, the present invention relates to . 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO. 10 or fragments, analogs or 
20 derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NOs 14 or fragments, analogs or 
25 derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NOi 16 or fragments, analogs or 
derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NOs* 10, 55 to 75, 77, 78, 79 or 
fragments, analogs or derivatives thereof. 
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According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NO: 10, 58, 60, 62, 64, 67, 68, 69, 72, 
74, 77 or fragments, analogs or derivatives thereof. 

5 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NO: 10, 58, 60, 62, 64, 67, 68, 69, 72, 
74, 77 or fragments, analogs or derivatives thereof. 

10 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NO: 10, 58, 60, 62, 64, 67, 68, 69, 72, 
74, 77 or fragments, analogs or derivatives thereof. 

15 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NO: 10, 62, 64, 67, 68, 74, 77 or 
fragments, analogs or derivatives thereof. 

20 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 58 or fragments, analogs or 
derivatives thereof . 

25 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 62 or fragments, analogs or 
derivatives thereof. 

30 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 64 or fragments, analogs or 
derivatives thereof. 

35 

According to one aspect, the present invention relates to 

15 
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polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 67 or fragments, analogs o 
derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 68 or fragments, analogs o: 
derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 74 or fragments, analogs oj 
derivatives thereof. 
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According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 77 or fragments, analogs or 
derivatives thereof. 

In a further embodiment, the present invention also relates 
to chimeric polypeptides which comprise one or more 
polypeptides or fragments, analogs or derivatives thereof 
as described in the present application. 

In a further embodiment, the present invention also relates 
to chimeric polypeptides which comprise one or more 
polypeptides or fragments, analogs or derivatives thereof 
as defined in the figures of the present application. 

In a further embodiment, the present application also 
relates to chimeric polypeptides which comprise two or more 
polypeptides chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 
16, 55 to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof /provided that the polypeptides or 
fragments, analogs or derivatives thereof are linked as to 
form a chimeric polypeptide. 
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In a further embodiment, the chimeric polypeptide will 
comprise two or more polypeptides chosen from SEQ ID 
NOs :10, 58, 60, 62, 64, 67, 68, 69, 72, 74, 77 or 

fragments, analogs or derivatives thereof; provided that 
the polypeptides or fragments, analogs or derivatives 
thereof are linked as to form a chimeric polypeptide. 

In a further embodiment, the chimeric polypeptide will 
comprise two or more polypeptides chosen from SEQ ID 
NOs :10, 58, 60, 62, 64, 67, 68, 74, 77 or fragments, 
analogs or derivatives thereof; provided that the 
polypeptides or fragments, analogs or derivatives thereof 
are linked as to form a chimeric polypeptide. 

In a further embodiment, the chimeric polypeptide will 
comprise two or more polypeptides chosen from SEQ ID 
NOs HO, 62, 64, 67, 68, 74, 77 or fragments, analogs or 
derivatives thereof; provided that the polypeptides or 
fragments, analogs or derivatives thereof are linked as to 
form a chimeric polypeptide. 

In a further embodiment, the chimeric polypeptide will 
comprise between 2 and 5 polypeptides. 

In a further embodiment, the chimeric polypeptide will 
comprise between 2 and 4 polypeptides. 

In a further embodiment, the chimeric polypeptide will 
comprise between 2 and 3 polypeptides. 

In a further embodiment, the chimeric polypeptide will 
comprise 2 polypeptides. 
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In a further embodiment, there is provided a chimeric 
polypeptide of formula (I) : 
A-(B) m -(C) a -D (I) 

5 Wherein; 

i is 0 or 1, 
n is 0 or 1, 

A is chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 

75, 77 to 79, 81, 83 or fragments, analogs or derivatives 
10 thereof; 

B is chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 

75, 77 to 79, 81, 83 or fragments, analogs or derivatives 
thereof ; 

C is chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 
75, 77 to 79, 81, 83 or fragments, analogs or derivatives 
thereof ; and 

D is chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 

75, 77 to 79, 81, 83 or fragments, analogs or derivatives 
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thereof . 



In a further embodiment, 

A is chosen from SEQ ID NOs :10 # 58, 60, 62, 64, 67, 68, 

69, 72, 74, 77 or fragments, analogs or derivatives 
thereof; . 

B is chosen from SEQ ID NOs ilO, 58, 60, 62, 64, 67, 68, 

69, 72, 74, 77, or fragments, analogs or derivatives 
thereof ; 

C is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 

69, 72, 74, 77 or fragments, analogs or derivatives 
>0 thereof; and 

D is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 
69, 72, 74, 77 or fragments, analogs or derivatives 
thereof . 

In a further embodiment, 
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A is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 

74, 77 or fragments, analogs or derivatives thereof; 
B is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 
74, 77, or fragments, analogs or derivatives thereof; 
C is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 
74, 77 or fragments, analogs or derivatives thereof; and 
D is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 
74, 77 or fragments, analogs or derivatives thereof. 

In one embodiment, chimeric polypeptides of the present 
invention comprise those wherein the following embodiments 
are present, either independently or in combination. 

In a further embodiment, A is SEQ ID NOs :10, 58, 62, 64, 
67, 68, 74, 77 or fragments, analogs or derivatives 
thereof . 

In a further embodiment, A is SEQ ID NO :10 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, A is SEQ ID NO :58 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, A is SEQ ID NO :62 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, A is SEQ ID NO :64 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, A is SEQ ID NO «67 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, A is SEQ ID NO :68 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, A is SEQ ID NO :74 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, A is SEQ ID NO i77 or fragments, 
analogs or derivatives thereof. 
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In a further embodiment, B is SEQ ID NOs :10, 58, 62 , 64, 
67, 68, 74, 77 or fragments, analogs or derivatives 
thereof • 

In a further embodiment, B is SEQ ID NO :10 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, B is SEQ ID NO :58 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, B is SEQ ID NO :64 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, B is SEQ ID NO :64 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, B is SEQ ID NO :67 or fragments, 
analogs or derivatives thereof . 

In a further embodiment, B is SEQ ID NO :68 or fragments, 
15 analogs or derivatives thereof . 

In a further embodiment, B is SEQ ID NO :74 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, B is SEQ ID NO : 77 or fragments, 
analogs or derivatives thereof. 

20 

In a further embodiment, C is SEQ ID NOs :10, 58, 62, 64, 
67, 68, 74, 77 or fragments, analogs or derivatives 
thereof . 

In a further embodiment, C is SEQ ID NO ilO or fragments, 
25 analogs or derivatives thereof. 

In a further embodiment, C is SEQ ID NO :58 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, C is SEQ ID NO t 62 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, C is SEQ ID NO :64 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, C is SEQ ID NO t 67 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, C is SEQ ID NO : 68 or fragments, 

20 
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analogs or derivatives thereof. 

In a further embodiment, C is SEQ ID NO : 74 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, C is SEQ ID NO : 77 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :10, 58, 62, 64, 
67, 68, 74, 77 or fragments, analogs or derivatives 
thereof . 

In a further embodiment, D is SEQ ID NO ilO or fragments, 
analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :58 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :62 or fragments, 
analogs or derivatives thereof . 

In a further embodiment, D is SEQ ID NO i64 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO »67 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :68 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :74 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :77 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, m is 0. 
In a further embodiment, n is 0. 

In a 'further embodiment, m and n are 0. 

In a further embodiment, xa and n are 0, A is SEQ ID NO: 64 
or fragments, analogs or derivatives thereof, B is SEQ ID 
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NO: 62 or fragments, analogs or derivatives thereof. 
In a further embodiment, m and n are 0, A is SEQ ID NO: 62 
or fragments, analogs or derivatives thereof, B is SEQ ID 
NO: 64 or fragments, analogs or derivatives thereof. 

In accordance with the present invention, all nucleotides 
encoding polypeptides and chimeric polypeptides are within 
the scope of the present invention. 

In a further embodiment, the polypeptides or chimeric 
polypeptides in accordance with the present invention are 
antigenic . 

In a further embodiment, the polypeptides or chimeric 
polypeptides in accordance with the present invention can 
elicit an immune response in an individual. 

In a further embodiment, the present invention also relates 
to polypeptides which are able to raise antibodies having 
binding specificity to the polypeptides or chimeric 
polypeptides of the present invention as defined above. 

An antibody that " has binding specificity" is an antibody 
that recognizes and binds the selected polypeptide but 
which does not substantially recognize and bind other 
molecules in a sample, e.g., a biological sample, which 
naturally includes the selected peptide. Specific binding 
can be measured using an ELISA assay in which the selected 
polypeptide is used as an antigen. 

Unless otherwise defined, all technical and scientific 
term§ used herein have the same meaning as commonly 
understood by one of ordinary skill in the art to which 
this invention belongs. All publications, patent 
applications, patents, and other references mentioned 
herein are incorporated by reference in their entirety. In 
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case of conflict, the present specification, including 
definitions, will control. In addition, the materials, 
methods, and examples are illustrative only and not 
intended to be limiting. 

As used herein, "fragments", "derivatives" or "analogs" of 
the polypeptides of the invention include those 
polypeptides in which one or more of the amino acid 
residues are substituted with a conserved or non-conserved 
amino acid residue (preferably conserved) and which may be 
natural or unnatural, in one embodiment, derivatives and 
analogs of polypeptides of the invention will have about 
70% identity with those sequences illustrated in the 
figures or fragments thereof. That is, 70% of the residues 
are the same. In a further embodiment, polypeptides will 
have greater than 75% homology, m a further embodiment, 
polypeptides will have greater than 80% homology. In a 
further embodiment, polypeptides will have greater than 85% 
homology. In a further embodiment, polypeptides will have 
greater than 90% homology, in a further embodiment, 
polypeptides will have greater than 95% homology, in a 
further embodiment, polypeptides will have greater than 99% 
homology, in a further embodiment, derivatives and analogs 
of polypeptides of the invention will have fewer than about 
20 amino acid residue substitutions, modifications or 
deletions and more preferably less than 10. Preferred 
substitutions are those known in the art as conserved i.e. 
the substituted residues share physical or chemical 
properties such as hydrophobic ity, size, charge or 
30 functional groups. 

In accordance with the present invention, polypeptides of 
the invention include both polypeptides and chimeric 
polypeptides . 

Also included are polypeptides which have fused thereto 
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other compounds which alter the polypeptides biological or 
pharmacological properties i.e. polyethylene glycol (PEG) 
to increase half -life; leader or secretory amino acid 
sequences for ease of purification; prepro- and pro- 
sequences; and (poly) saccnarides. 

Furthermore, in those situations where amino acid regions 
are found to be polymorphic, it may be desirable to vary 
one or more particular amino acids to more effectively 
mimic the different epitopes of the different streptococcus 
strains . 

Moreover, the polypeptides of the present invention can be 
modified by terminal -nh, acylation (eg. by acetylation, or 
thioglycolic acid amidation, terminal carbosy amidation, 
e.g. with ammonia or methylamine) to provide stability, 
increased hydrophobicity for linking or binding to a 
support or other molecule. 

Also contemplated are hetero and homo polypeptide multimers 
of the polypeptide fragments, analogues and derivatives. 
These polymeric forms include, for example, one or more 
polypeptides that have been cross-linked with cross-linkers 
such as avi din/bio tin, gluteraldehyde or dimethyl - 
superimidate . Such polymeric forms also include 
polypeptides containing two or more tandem or inverted 
contiguous sequences, produced from multicistronic mRNAs 
generated by recombinant DNA technology. 
Preferably, a fragment, analog or derivative of a 
polypeptide of the invention will comprise at least one 
antigenic region i.e. at least one epitope. 

In order to achieve the formation of antigenic polymers 
(i.e. synthetic multimers), polypeptides may be utilized 
having bishaloacetyl groups, nitroarylhalides , or the like, 
where the reagents being specific for thio groups. 
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Therefore, the link between two mercapto groups of the 
different peptides may be a single bond or may be composed 
of a linking group of at least two, typically at least 
four, and not more than 16, but usually not more than about 
5 14 carbon atoms. 



10 



In a particular embodiment, polypeptide fragments, analogs 
and derivatives of the invention do not contain a 
methionine (Met) starting residue. Preferably, 
polypeptides will not incorporate a leader or secretory 
sequence (signal sequence) . The signal portion of a 
polypeptide of the invention may be determined according to 
established molecular biological techniques, in general, 
the polypeptide of interest may be isolated from a 
15 streptococcus culture and subsequently sequenced to 

determine the initial residue of the mature protein and 
therefore the sequence of the mature polypeptide. 
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According to another aspect, there are provided vaccine 
compositions comprising one or more streptococcus 
polypeptides of the invention in admixture with a 
pharmaceutical^ acceptable carrier diluent or adjuvant. 
Suitable adjuvants include oils i.e. Freund's complete or 
incomplete adjuvant; salts i.e. A1K(S0 4 ) 2 , AlNa(S0 4 ),, 
AlNH 4 (S0 4 ) a , silica, kaolin, carbon polynucleotides i.e. 
poly IC and poly AU. Preferred adjuvants include QuilA and 
Alhydrogel. Vaccines of . the invention may be administered 
parenterally by injection, rapid infusion, nasopharyngeal 
absorption, dermoabsorption, or bucal or oral. 
Pharmaceutical^ acceptable carriers also include tetanus 
toxoid. 

Vaccine compositions of the invention are used for the 
treatment or prophylaxis of streptococcus infection and/or 
diseases and symptoms mediated by streptococcus infection 
as described in P.R. Murray (Ed, in chief), E.J. Baron, M.A. 
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Pfaller, F.C. Tenover and R.H. Yolken. Manual of Clinical 
Microbiology, ASM Press, Washington, D.C. sixth edition, 
1995, 1482p which are herein incorporated by reference. In 
one embodiment, vaccine compositions of the present 
invention are used for the treatment or prophylaxis of 
meningitis, otitis media, bacteremia or pneumonia. in one 
embodiment, vaccine compositions of the invention are used 
for the treatment or prophylaxis of streptococcus infection 
and/or diseases and symptoms mediated by streptococcus 
infection, in particular S.pneumorH group A 
streptococcus (pyogenes) , group B streptococcus (GBS or 
agalactiae) , dysgalactiae, uteris, nocardia as well as 
Staphylococcus aureus, in a further embodiment, the 
streptococcus infection is S. pneumoniae 

In a particular embodiment, vaccines are administered to 
those individuals at risk of streptococcus infection such 
as infants, elderly and immunocompromised individuals. 

As used in the present application, the term - individuals- 
include mammals. In a further embodiment, the mammal is 
human. 
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Vaccine compositions are preferably in unit dosage form of 
about 0.001 to 100 ng/kg (antigen/body weight) and more 
preferably 0.01 to 10 jig/kg and most preferably 0.1 to 1 
*ig/kg 1 to 3 times with an interval of about 1 to 6 week 
intervals between immunizations. 

According to another aspect, there are provided 
polynucleotides encoding polypeptides characterized by the 
amino' acid sequence chosen from SEQ id NOs: 2, 4, 6, 8, 10, 
14, 16, 55 to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof. 
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In one embodiment, polynucleotides are those illustrated in 
SEQ ID Nos: 1, 3 , 5, 7, 9. 11, 12. 13, 15, 76, 80, 82 

which may include the open reading frames (ORF) , encoding 
polypeptides of the invention, it will be appreciated that 
the polynucleotide sequences illustrated in the figures may 
be altered with degenerate codons yet still encode the 
polypeptides of the invention. Accordingly the present 
invention further provides polynucleotides which hybridize 
to the polynucleotide sequences herein above described (or 
the complement sequences thereof) having 50% identity 
between sequences. In one embodiment, at least 70% identity 
between sequences. In one embodiment, at least 75% identity 
between sequences, in one embodiment, at least 80% identity 
between sequences. In one embodiment, at least 85% identity 
between sequences. In one embodiment, at least 90% identity 
between sequences. In a further embodiment, polynucleotides 
are hybridizable under stringent conditions i.e. having at 
least 95% identity. In a further embodiment, more than 97% 
identity. 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NOs : 1, 3, 7, 9, 11, 12, 13, 15, 76, 
80, 82 encoding polypeptides of the invention. 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NOs : 1, 3, 9, 11, i 2 , 13, 15, 76, 
80, 82 which may include the open reading frames (ORF) , 
encoding polypeptides of the invention. 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NOs , 1, 3, 9, 11, 12, 13, 15, 76 
which' may include the open reading frames (ORF) , encoding 
polypeptides of the invention. 

35 In a further embodiment, polynucleotides are those 
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illustrated in SEQ ID NOs : 1, 3, 7, 9, 11, 12, 13, 15, 76 

which may include the open reading frames (ORF) , encoding 
polypeptides of the invention. 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NOs : 1, 7, 9, 11, 15, 76 which may 
include the open reading frames (ORF) , encoding 
polypeptides of the invention. 

In a. further embodiment, polynucleotides are those 
illustrated in SEQ ID NOs : 1, 9, 11, 15, 76 which may 
include the open reading frames (ORF) , encoding 
polypeptides of the invention. 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NOs : 1, 7, 9, 11 which may include 

the open reading frames (ORF) , encoding polypeptides of the 
invention. 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NO : 1, encoding polypeptides of the 
invention. 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NO *7, encoding polypeptides of the 
invention. 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NO t9, encoding polypeptides of the 
invention. 

i 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NO ill, encoding polypeptides of the 
invention. 
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In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NO :15, encoding polypeptides of the 
invention. 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NOs : 3, 12, 13, 76, encoding 
polypeptides of the invention. 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NO :3, encoding polypeptides of the 
invention . 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NO :12, encoding polypeptides of the 
invention . 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NO t!3, encoding polypeptides of the 
invention. 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NO :76, encoding polypeptides of the 
invention. 

As will be readily appreciated by one skilled in the art, 
polynucleotides include both DNA and RNA. 

The present invention also includes polynucleotides 
complementary to the polynucleotides described in the 
present application. 

In a further aspect, polynucleotides encoding polypeptides 
of the invention, or fragments, analogs or derivatives 
thereof, may be used in a DNA immunization method. That 
is, they can be incorporated into a vector which is 
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replicable and expressible upon injection thereby producing 
the antigenic polypeptide in vivo. For example 
polynucleotides may be incorporated into a plasmid vector 
under the control of the CMV promoter which is functional 
in eukaryotic cells. Preferably the vector is injected 
intramuscularly. 

According to another aspect, there is provided a process 
for producing polypeptides of the invention by recombinant 
techniques by expressing a polynucleotide encoding said 
polypeptide in a host cell and recovering the expressed 
polypeptide product. Alternatively, the polypeptides can 
be produced according to established synthetic chemical 
techniques i.e. solution phase or solid phase synthesis of 
oligopeptides which are ligated to produce the full 
polypeptide (block ligation) . 

General methods for obtention and evaluation of 
polynucleotides and polypeptides are described in the 
following references: Sambrook et al, Molecular Cloning: A 
Laboratory Manual, 2nd ed, Cold Spring Harbor, N.Y., 1989; 
Current Protocols in Molecular Biology, Edited by Ausubel 
F.M. et al., John Wiley and Sons, Inc. New York; PCR 
Cloning Protocols, from Molecular Cloning to Genetic 
Engineering, Edited by White B.A., Humana Press, Totowa, 
New Jersey, 1997, 490 pages; Protein Purification, 
Principles and Practices,. Scopes R.K. , Springer-Verlag, New 
York, 3rd Edition, 1993, 380 pages; Current Protocols in 
Immunology, Edited by Coligan J.E. et al . , John Wiley & 
Sons Inc . , New York which are herein incorporated by 
reference. 

For recombinant production, host cells are transfected with 
vectors which encode the polypeptide, and then cultured in 
a nutrient media modified as appropriate for. activating 
promoters, selecting transf ormants or amplifying the genes. 
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Suitable vectors are those that are viable and replicable 
in the chosen host and include chromosomal, non- chromosomal 
and synthetic DNA sequences e.g. bacterial plasmids, phage 
DNA, baculovirus, yeast plasmids, vectors derived from 
combinations of plasmids and phage DNA. The polypeptide 
sequence may be incorporated in the vector at the 
appropriate site using restriction enzymes such that it is 
operably linked to an expression control region comprising 
a promoter, ribosome binding site (consensus region or 
Shine-Dalgarno sequence) , and optionally an operator 
(control element) . One can select individual components of 
the expression control region that are appropriate for a 
given host and vector according to established molecular 
biology principles (Sambrook et al, Molecular Cloning: A 
Laboratory Manual, 2nd ed, Cold Spring Harbor, N.Y. , 1989; 
Current Protocols in Molecular Biology, Edited by Ausubel 
F.M. et al., John Wiley and Sons, Inc. New York 
incorporated herein by reference) . Suitable promoters 
include but are not limited to LTR or SV40 promoter, E.coli 
lac, tac or trp promoters and the phage lambda P L promoter. 
Vectors will preferably incorporate an origin of 
replication as well as selection markers i.e. ampicilin 
resistance gene. Suitable bacterial vectors include pET, 
PQE70, pQE60, pQE-9, pbs, pDIO phagescript, psiX174, 
pbluescript SK, pbsks, pNH8A, pNH16a, pNH18A, pNH46A, 
ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 and eukaryotic 
vectors pBlueBacIII, pWLNEO, pSV2CAT, pOG44, pXTl, pSG, 
pSVK3 , pBPV, pMSG and pSVL. Host cells may be bacterial 
i.e. E.coli, Bacillu s subtilis . Streptomvces : fungal i.e. 
Aspergillus nicrer. Aspergillus nidulins : yeast i.e. 
Sacchar omvces or eukaryotic i.e. CHO, COS. 

Upon expression of the polypeptide in culture, cells are 
typically harvested by centrifugation then disrupted by 
physical or chemical means (if the expressed polypeptide is 
not secreted into the media) and the resulting crude 
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.extract retained to isolate the polypeptide of interest. 
Purification of the polypeptide from culture media or 
lysate may be achieved by established techniques depending 
on the properties of the polypeptide i.e. using ammonium 
sulfate or ethanol precipitation , acid extraction, anion 
or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, 
hydroxylapatite chromatography and lectin chromatography. 
Final purification may be achieved using HPLC. 

The polypeptide may be expressed with or without a leader 
or secretion sequence. In the former case the leader may 
be removed using post-translational processing (see US 
4,431,739; US 4,425,437; and US 4,338,397 incorporated 
herein by reference) or be chemically removed subsequent to 
purifying the expressed polypeptide. 

According to a further aspect, the streptococcus 
polypeptides of the invention may be used in a diagnostic 
test for streptococcus infection, in particular S. 
pneumoniae infection. Several diagnostic methods are 
possible, for example detecting streptococcus organism in a 
biological sample, the following procedure may be followed: 

a) obtaining a biological sample from a patient; 

b) incubating an antibody or fragment thereof reactive 
with a streptococcus polypeptide of the invention with 
the biological sample to form a mixture; and 

c) detecting specifically bound antibody or bound 
fragment in the mixture which indicates the presence 
of streptococcus . 

Alternatively, a method for the detection of antibody 
specific to a streptococcus antigen in a biological sample 
containing or suspected of containing said antibody may be 
performed as follows: 

a) obtaining a biological sample from a patient; 
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b) incubating one or more streptococcus polypeptides of 
the invention or fragments thereof with the biological 
sample to form a mixture; and 

c) detecting specifically bound antigen or bound fragment 
in the mixture which indicates the presence of 
antibody specific to streptococcus. 

One of skill in the art will recognize that this diagnostic 
test may take several forms, including an immunological 
test such as an enzyme-linked immunosorbent assay (ELISA) , 
a radioimmunoassay or a latex agglutination assay, 
essentially to determine whether antibodies specific for 
the protein are present in an organism. 

15 The DNA sequences encoding polypeptides of the invention 
may also be used to design DNA probes for use in detecting 
the presence of streptococcus in a biological sample 
Suspected of containing such bacteria. The detection 
method of this invention comprises: 

a) obtaining the biological sample from a patient; 

b) incubating one or more DNA probes having a DNA 
sequence encoding a polypeptide of the invention or 
fragments thereof with the biological sample to form a 
mixture ; and 

25 c) detecting specifically bound DNA probe in the mixture 
which indicates the presence of streptococcus 
bacteria . 
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The DNA probes of this invention may also be used for 
detecting circulating streptococcus i.e. 

S.pneumoniae nucleic acids in a sample, for example using a 
polymerase chain reaction, as a method of diagnosing 
streptococcus infections. The probe may be synthesized 
using conventional techniques and may be immobilized on a 
solid phase, or may be labelled with a detectable label. A 
preferred DNA probe for this application is an oligomer 
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having a sequence complementary to at least about 6 
contiguous nucleotides of the streptococcus pneumoniae 
polypeptides of the invention. 

5 Another diagnostic method for the detection of 
streptococcus in a patient comprises: 

a) labelling an antibody reactive with a polypeptide of 
the invention or fragment thereof with a detectable 
label ; 

10 b) administering the labelled antibody or labelled 
fragment to the patient; and 
c) detecting specifically bound labelled antibody or 

labelled fragment in the patient which indicates the 
presence of streptococcus. 

15 

A further aspect of the invention is the use of the 
streptococcus polypeptides of the invention as immunogens 
for the production of specific antibodies for the diagnosis 
and in particular the treatment of streptococcus infection. 
20 Suitable antibodies may be determined using appropriate 

screening methods, for example by measuring the ability of 
a particular antibody to passively protect against 
streptococcus infection in a test model. One example of an 
animal model is the mouse model described in the examples 
25 herein. The antibody may be a whole antibody or an 

antigen-binding fragment thereof and may belong to any 
immunoglobulin class. The antibody or fragment may be of 
animal origin, specifically of mammalian origin and more 
specifically of murine, rat or human origin. It may be a 
natural antibody or a fragment thereof, or if desired, a 
recombinant antibody or antibody fragment. The term 
recombinant antibody or antibody fragment means antibody or 
antibody fragment which was produced using molecular 
biology techniques. The antibody or antibody fragments may 
be polyclonal, or preferably monoclonal. It may be. 
specific for a number of epitopes associated with the 
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streptococcus pneumoniae polypeptides but is preferably 
specific for one. 

Without limiting its scope, the present invention also 
5 relates to new antigens designated BVH-3, BVH-11, BVH-11-2, 
BVH-28 and BVH-71. The present invention also relates to 
truncated polypeptides comprising fragments of the new 
antigens designated BVH-3, BVH-11, BVH-11-2, BVH-28 and 
BVH-71. The present invention also relates to chimeric 
10 polypeptides comprising fragments of the new antigens 

designated BVH-3, BVH-11, BVH-11-2, BVH-28 and BVH-71. The 
following is a reference table summarizing the relation 
between the antigens of the present invention: 



Family 


Nucleotide SEQ id 
NO 


Polypeptide SEQ ID 
NO 


BVH-3 


BVH-3 


1, 11 


2 


BVH-3 A 


7 


8 


BVH-3B 


9 


10 


BVH-3 SP63 


15 


16 


BVH-3M 




55 


BVH-3 AD 




56 


L-BVH-3AD 




57 


Newl2 


76 


58 


BVH-3C 




59 


Newl 




64 


New2 




65 


New3 




66 


Newl 5 




78 


BVH-11 




BVH-11 


3, 12 


4 


BVH-11-2 


13 


14 


BVH-11M 




60 


BVH-11A 




61 


BVH-11B also 




62 


referred to as 




NEW13 






BVH-11C 




63 


New4 




67 


New5 


68 
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Family 


Nucleotide SEQ ID 


PolvDeiatide SEO ID 




NO 


NO 


New6 




69 


New7 




70 


New8 




71 


New9 




72 


BVH-11-2M 




73 


NewlO 




74 


Newll 




75 


Newl2 


76 


58 


Newl4 




77 


Newl6 




79 


BVH-28 


BVH-28 


5 


6 


BVH-71 


6BS 


80 


81 


GAS 


82 


83 



EXAMPLE 1 

This example illustrates the cloning of pneumoniae genes. 

The coding region of pneumoniae gene BVH-3 (SEQ ID NO: 1) 

and the coding region of pneumoniae gene BVH-28 (SEQ ID 

NO: 5) were amplified by PCR (DNA Thermal Cycler GeneAmp 

PCR system 2400 Perkin Elmer, San Jose, CA) from genomic DNA 

of serogroup 6 pneumoniae strain SP64 using the oligos 

that contained base extensions for the addition ,of 

restriction sites Bglll (AGATCT) and Xbal (TCTAGA) . PCR 

products were purified from agarose gel using a QIAquick gel 

extraction kit from QIAgen (Chatsworth, CA) , digested Bglll- 

Xbal (Pharmacia Canada Inc, Baie d'Urfe, Canada), extracted 

with phenol : chloroform and precipitated with ethanol. The 
i 

Superlinker vector pSL301 (Invitrogen, San Diego, CA) was 
digested with Bglll and Xbal and purified from agarose gel 
using a QIAquick gel extraction kit from QIAgen (Chatsworth, 
CA) . The Bglll-Xbal genomic DNA fragments were ligated to 
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the Bglll-Xbal pSL301 vector. The ligated products were 
transformed into coli strain DH5a [f80 lacZ DM15 endAl 
recAl hsdR17 ( r K" m K + ) supE44 thi-11' gyrA96 relAl D(lacZYA- 
argF)U169] (Gibco BRL, Gaithersburg, MD) according to the 
method of Simanis (Hanahan, D. DNA Cloning, 1985, D.M. 
Glover (ed) , pp. 109-135). Recombinant pSL301 plasmids 
(rpSL301) containing either BVH-3 or BVH-28 gene were 
purified using a QIAgen kit (Chatsworth, CA) and DNA inserts 
were confirmed by nucleotide sequence analysis (Taq Dye 
Deoxy Terminator Cycle Sequencing kit, ABI, Foster City, 
CA) . Recombinant rpSL301 (rpSL301) were digested with the 
restriction enzymes Bglll (AGATCT) and Xhol (CTCGAG) . DNA 
fragments Bglll-Xhol were purified using the QIAquick gel 
extraction kit from QIAgen (Chatsworth, CA) . pET-32c(+) 
expression vector (Novagen, Madison, WI) containing the 
thioredoxin-His-Tag sequence was digested with BamHI 
(GGATCC) and Xhol and gel extracted using the QIAquick gel 
extraction kit from QIAgen (Chatsworth, CA) . The Bglll-Xhol 
DNA fragments were ligated to the BamHI-XhoI pET-32c(+) 
vector to create the coding sequence for thioredoxin- 
His-Tag-BVH-3 or thioredoxin-His -Tag-BVH-28 fusion protein. 
The ligated products were transformed into E^ coli strain 
DH5a [f80 lacZ DM15 endAl recAl hsdR17 ( r K _m K + ) supE44 thi-11" 
gyrA96 relAl D(lacZYA-argF)U169] (Gibco BRL, Gaithersburg, 
MD) according to the method of Simanis (Hanahan, D. DNA 
Cloning, 1985, D.M. Glover (ed) , pp. 109-135). Recombinant 
pET-32c(+) plasmids were purified using a QIAgen kit 
(Chatsworth, CA) and the nucleotide sequences at the fusion 
sites of thioredoxin-His -Tag and DNA insert were verified by 
DNA sequencing (Taq Dye Deoxy Terminator Cycle Sequencing 
kit, ABI, Foster City, CA) . 
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EXAMPLE 2 



15 



This example illustrates the cloning of pneumoniae 
5 protein genes in CMV plasmid pCMV-GH. 

The DMA coding region of a S. pneumoniae protein was 
inserted in phase downstream of a human growth hormone (hGH) 
gene which was under the transcriptional control of the 
cytomegalavirus (CMV) promoter in the plasmid vector pCMV-GH 
(Tang et al., Nature, 1992, 356 :152) . The CMV promotor is 
non functional plasmid in E_. coll cells but active upon 
administration of the plasmid in eukaryotic cells. The 
vector also incorporated the ampicillin resistance gene. 



10 



20 



The coding region of BVH-3 gene (SEQ ID NO: 1) and BVH-28 
gene (SEQ ID NO: 5) were obtained from rpSL301 (see example 
1) using restriction enzymes Bglli (AGATCT) and Xbal 
(TCTAGA) . The digested products were purified from agarose 
gel using the QIAquick gel extraction kit from QlAgen 
(Chatsworth, CA) . The pCMV-GH vector (Laboratory of Dr. 
Stephen A. Johnston, Department of Biochemistry, The 
University of Texas, Dallas, Texas) containing the human 
growth hormone to create fusion proteins was digested with 
25 Bglll and Xbal and purified from agarose gel using the 

QIAquick gel extraction kit from QlAgen (Chatsworth, CA) . 
The Bglij-xbal DNA fragments were ligated to the Bglli-xbal 
PCMV-GH vector to create the hGH- BVH-3 or hGH-BVH-28 fusion 
protein under the control of the CMV promoter. The ligated 
products were transformed into E^ cpli strain DH5aff80 JacZ 
DM15 endAl recAl hsdR17 ( W) SU pE44 thi-11" gyrA96 relAl 
D(lac Z YA-argF)U169] (Gibco BRL, Gaithersburg, MD) according 
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to the method of Simanis (Hanahan, D. DNA Cloning, 1985, 
D.M. Glover (ed) , pp. 109-135). The recombinant pCMV 
plasmids were purified using a QIAgen kit (QIAgen, 
Chatsworth, CA) . 

The coding region of BVH-11 gene (SEQ ID NO: 3) was 
amplified by PCR (DNA Thermal Cycler GeneAmp PCR system 2400 
Perkin Elmer, San Jose, CA) from genomic DNA of serogroup 6 

pneumoniae strain SP64 using the oligos that contained 
base extensions for the addition of restriction sites Bglll 
(AGATCT) and HindHI (AAGCTT) . The PCR product was purified 
from agarose gel using a QIAquick gel extraction kit from 
QIAgen (Chatsworth, CA) , digested with restriction enzymes 
(Pharmacia Canada Inc, Baie d'Urfe, Canada), extracted with 
phenol : chloroform and precipitated with ethanol. The 
pCMV-GH vector (Laboratory of Dr. Stephen A. Johnston, 
Department of Biochemistry, The University of Texas, Dallas, 
Texas) was digested with Bglll and HindHI and purified from 
agarose gel using the QIAquick gel extraction kit from 
QIAgen (Chatsworth, CA) . The Bglll-Hindlll DNA fragment was 
ligated to the Bglll-Hindlll pCMV-GH vector to create the 
hGH-BVH-11 fusion protein under the control of the CMV 
promoter. The ligated products were transformed into EL coli 
strain DH5a[f80 lacZ DM15 endAl recAl hsdR17 ( r K _m K + ) supE44 
thi-ll" gyrA96 relAl D(IacZYA-argrF)U169] (Gibco BRL, 
Gaithersburg, MD) according to the method of Simanis 
(Hanahan, D. DNA Cloning, 1985, D.M. Glover (ed) , pp. 109- 
135) . The recombinant pCMV plasmid was purified using a 
QIAgen kit (Chatsworth, CA) and the nucleotide sequence of 
the DNA insert was verified by DNA sequencing. 
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EXAMPLE 3 



10 



This example illustrates the use of DNA to elicit an immune 
response to pneumoniae antigens. 

A group of 8 female BALB/c mice (Charles River, St-Constant, 
Quebec, Canada) were immunized by intramuscular injection of 
50 ul three times at two- or three-week intervals with 100 
fig of recombinant pCMV-GH encoding the BVH-3, BVH-11 or the 
BVH-28 gene in presence of 50 ug of granulocyte-macrophage 
colony-stimulating factor (GM-CSF) - expressing plasmid pCMV- 
GH-GM-CSF (Laboratory of Dr. Stephen A. Johnston, Department 
of Biochemistry, The University of Texas, Dallas, Texas) . 
As control, a group of mice were injected with 100 ug of 
15 pCMV-GH in presence of 50 Ug of pCMV-GH- GM-CSF . Blood 
samples were collected from the orbital prior to each 
immunization and seven days following the third injection 
and serum antibody responses were determined by ELISA using 
thioredoxin-His-Tag-S^ pneumoniae fusion protein as coating 
20 antigen. DNA immunization with recombinant plasmid pCMV-GH 
encoding the BVH-3, BVH-11 or the BVH-28 S^ pneumoniae 
protein induced antibody reactive against the respective 
recombinant protein. The reciprocal antibody titers, defined 
as the highest serum dilution at which the absorbance values 
15 were 0.1 above the background values, were above 4x1 0 3 . 



EXAMPLE 4 



30 This example illustrates the production and purification of 
recombinant S^ pneumoniae proteins. 
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The recombinant pET plasmids containing the BVH-3, BVH-11 or 
the BVH-28 gene corresponding to the SEQ ID NO: 1 , SEQ ID 
NO: 3 or the SEQ ID NO: 5 respectively were transformed by 
electroporation (Gene Pulser II apparatus, BIO-RAD Labs, 
Mississauga, Canada) into E^ coli strain AD494 (DE3) (Dara" 
leu7697 DlacX74 DphoA PvuIT phoR DmaJF3 F' [Iac + (Iacl q ) pro] 
trxB::Kan) (Novagen, Madison, WI) . In this strain of E^ 
coli, the T7 promotor controlling expression of the fusion 
protein is specifically recognized by the T7 RNA polymerase 
(present on the 1DE3 prophage) whose gene is under the 
control of the lac promotor which is inducible by isopropyl- 
B-d-thio-galactopyranoside (IPTG) . The transformant 
AD494(DE3)/rpET was grown at 37°C with agitation at 250 rpm 
in LB broth (peptone lOg/L, yeast extract 5g/L, NaCl lOg/L) 
containing 100|Lig of ampicillin ( Sigma -Aldrich Canada Ltd., 
Oakville, Canada) per ml until the Aeoo reached a value of 
0.6. In order to induce the production of the thioredoxin- 
His * Tag-BVH-3 , thioredoxin-His -Tag-BVH-11 or thioredoxin- 
His-Tag-BVH-28 fusion protein, the cells were incubated for 
2 additional hours in the presence of IPTG at a final 
concentration of 1 mM. Induced cells from a 100 ml culture 
were pelleted by centrifugation and frozen at -70°C. 

The purification of the fusion proteins from the soluble 
cytoplasmic fraction of IPTG- induced AD494 (DE3) /rpET was 
done by affinity chromatography based on the properties of 
the His -Tag sequence (6 consecutive histidine residues) to 
bind to divalent cations (Ni 2+ ) immobilized on the His -Bind 
metal chelation resin. Briefly, the pelleted cells obtained 
from a lOOmL culture induced with IPTG were resuspended in 
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phosphate-buffered (PBS) : 500mM NaCl pH7.1, sonicated and 
spun at 20,000 X g for 20 min to remove debris. The 
supernatant was filtered (0.22um pore size membrane) and 
deposited on a HiTrap® lmL chelating pre-packed ready-to-us 
column (Pharmacia Biotech, Baie d'Urfe, Canada) . The 
thioredoxin-His-Tag-S^ pneumoniae fusion protein was eluted 
with 1M imidazole- 5 OOmM NaCl -PBS pH7.1. The removal of the 
salt and imidazole from the sample was done by dialysis 
against PBS at 4°C. The quantities of fusion protein 
obtained from the soluble fraction of E^ coli was estimated 
by MicroBCA (Pierce, Rockford, Illinois) . 



EXAMPLE 5 

15 



This example illustrates the protection of mice against 
fatal pneumococcal infection by immunization. 
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Groups of 8 female BALB/c mice (Charles River) were 
immunized subcutaneously three times at three-week intervals 
with either 25 »g of affinity purified thioredoxin-His-Tag- 
BVH-3 fusion protein in presence of 15 ng of QuilA adjuvant 
(Cedarlane Laboratories Ltd, Hornby, Canada) or, as control, 
with QuilA adjuvant alone in PBS. Blood samples were 
collected from the orbital sinus on day 1, 22 and 43 prior 
to each immunization and seven days (day 50) following the 
third injection. Cine week later the mice were challenged 
with approximately 10 6 CFU of the type 3 S^ pneumoniae 
strain WU2 . Samples of the S^ pneumoniae challenge inoculum 
were plated on chocolate agar plates to determine the CFU 
and to verify the challenge dose. Deaths were recorded for 
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a period of 14 days and on day 14 post-challenge, the 
surviving mice were sacrificed and blood samples tested for 
the presence of pneumoniae organisms . The survival data 
are shown in table 1. 



10 



Prechallenge sera were analyzed for the presence of 
antibodies reactive with pneumoniae by standard 
immunoassays. Elisa and immunoblot analyses indicated that 
immunization with recombinant pneumoniae protein produced 
in EL coli elicited antibodies reactive with both, 
recombinant and native pneumococcal protein. 

Table 1. Protection mediated by recombinant BVH-3 protein 



Immunogen 


No. of mice alive : no. of mice 
dead 

14 days post -challenge 


Median day of 
death 


BVH-3 


8 : 0 


>14 


none 


0:8 


1 



15 



20 



All mice immunized with BVH-3 recombinant protein survived 
to infection while none of the control mice given adjuvant 
alone survived. There was a significant difference in 
survival between the two groups of mice (P<0.0001, log rank 
test for nonparametric analysis of survival curves; 
P=0.0002, Fisher's exact test). All hemocultures from 
surviving mice were negative at day 14 post-challenge. 
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This example describes the cloning of BVH-3 and BVH-11 
genes from a variety of S^. pneumoniae strains and the 
molecular conservation of these genes. 

5 Molecular analysis of chromosomal DNA from various 

pneumoniae isolates with DNA probes spanning different 
regions of BVH-3 or BVH-11 revealed the presence of one 
BVH-3 ^ene copy and two BVH-11 gene copies. The two BVH-11 
gene copies are not identical and the genes were 
10 arbitrarily designated BVH-11 (SEQ ID NO: 12; ORF at 

nucleotides 45 to 2567) and BVH-11-2 (SEQ ID NO: 13; ORF at 
nucleotides 114 to 2630) . 
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The first amino acids of the BVH-3 and BVH-11 coding 
regions have the characteristics of leader sequences also 
known as signal peptides. The consensus signal peptidase 
cleavage site L-X-X-C of lipoprotein 

modification/processing sites was present in the sequences. 
Mature BVH-3, BVH-11 and BVH-11-2 proteins from ^ 
pneumoniae SP64 have 1019, 821 and 819 amino acids, 
respectively. The regions of pneumoniae? genes coding 
for mature BVH-3, termed BVH-3M, (nucleotides 1837 - 4896; 
SEQ. ID. NO: 11), BVH-llM (nucleotides 102-2567; SEQ. ID. 
NO: 12) and BVH-11-2M (nucleotides 171-2630; SEQ. ID. NO: 
25 13) , were amplified by PCR(DNA Thermal Cycler GeneAmp PCR 
system 2400 Perkin Elmer, San Jose, CA) from genomic DNA of 
6 or 7 pneumoniae strains. Serogroup 6 S^. pneumoniae 
SP64 and serogroup 9 SP63 clinical isolates were provided 
by the laboratoire de la sante publique du Quebec, Sainte- 
Anne-de-Bellevue; serotype 4 strain JNR.7/87 was provided 
by Andrew Camilli, Tufts University School of Medicine, 
Boston; Rxl strain, a nonencapsulated derivative of the 
type 2 strain D39 and the type 3 strains A66 and WU2 were 
provided by David E. Briles from University of Alabama, 
Birmingham and the type 3 clinical isolate P4241 was 
provided by the centre de recherche en infectiologie du 
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centre hospitalier de 1' university Laval, Sainte-Foy. The 
sets of oligonucleotide primers OCRR479-OCRR480; HAMJ160- 
OCRR488 and HAMJ160-HAMJ186 , that contained base extensions 
for the addition of restriction sites were used for the 
amplification of BVH-3 . BVH-11 and BVH-11-2 gene, 
respectively, with the exception of BVH-11 gene from SP64 
strain which was amplified using the set of primers 
consisting of HAMJ487 and OCRR488. Primer sequences are 
listed below (Table 2). PCR products were purified from 
agarose gel using a QIAquick gel extraction kit from QiAgen 
(Chatsworth, CA) and digested Bglll-Xbal or Bglll-Hindlll 
(Pharmacia Canada Inc, Baie d'Urfe, Canada) . Digestions 
were cleaned using a QIAquick PCR purification kit from 
QIAgen (Chatsworth, CA) . The PCR products were ligated to 
15 the Bglll-Xbal or Bglll-Hindlll pSL301 vector. The ligated 
products were transformed into E. coli strain DH5cc [<J>80 
lacZ AM15 endAl recAl hsdR17 ( t K m K t ) supE44 thi-lX" gyrh96 
relAl A(IacZYA-argrF)U169] (Gibco BRL, Gaithersburg, MD) 
according to the method of Simanis (Hanahan, D. DNA 
Cloning, 1985, D.M. Glover (ed) , pp. 109-135). Recombinant 
PSL301 plasmids (rpSL301) containing BVH-3 . BVH-11 or 
BVH11-2 were purified using a QIAgen kit (Chatsworth, CA) 
and DNA inserts were sequenced (Taq Dye Deoxy Terminator 
Cycle Sequencing kit, ABI, Foster City, CA) . The figures 11 
and 12 depict the consensus sequence established from the 
BVH ~ 3 ' and BVH-11 deduced amino acid sequences, 
respectively. Comparison of BVH-3 protein sequences 
revealed 99 to 100% identity of sequences for all strains 
with the exception that BVH-3 from serogroup 9 SP63 strain 
30 (SEQ. ID. NO: 15 and SEQ. ID. NO: 16) misses a stretch of 
177 amino acids corresponding to residues 244 to 42C on 
BVH-3 'protein sequence of S^. pneumoniae SP64 . Analysis of 
sequences of additional serogroup 9 strains revealed BVH-3 
molecule having the same deletion in 3 out of 4 strains 
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thus suggesting that the 3 strains are members of a S... 
pneumoniae serogroup 9 clone. 

Comparison of 13 BVH-11 nucleotide sequences obtained from 
7 £L_ pneumoniae strains, revealed that the nucleotide 
sequences are very similar. Computer analysis (MacVector, 
Clustal W 1.4) using multiple alignment of the predicted 
BVH-11 protein sequences revealed that these sequences were 
75% identical and 82 % homologous on a length of 834 amino 
acids. Pairwise alignment revealed 80 to 100% identity 
(Figure 13) . The sequences showed great similarity in 
overall organization. Variability in the primary sequence 
of these proteins is almost restricted to the last 125 
amino acids in the C- terminal portion of the proteins. This 
region constitutes a domain. Close examination of this 
domain revealed two groups of sequences. The first 9 
sequences from the figure 13 belong to one group while the 
last 4 sequences belong to another group. A 39% identity 
value is obtained when the domain sequences of the 13 
proteins are compared (MacVector, Clustal W 1.4). The 
identity value increased to more than 92% when sequences 
belonging to a same group are compared. 

EXAMPLE 7 

This example illustrates the homology of portions of BVH-3 
and BVH-11 genes. 

Molecular analysis with DNA probes derived from BVH-3 and 
BVH-11 genes indicated that BVH-3 and BVH-11 were related. 
In dot blot hybridization studies, DNA probe consisting of 
either, BVH-3 or BVH-11, gene sequence hybridized to both, 
BVH " 3 and BVH-11 genes thus indicating that BVH-3 and BVH- 
11 genes shared homologous sequences. Comparison of 
sequences revealed that the ORFs and the proteins were 43 
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and 33% identical, respectively. Closer examination 
revealed that the region corresponding to amino acids 1 to 
225 in BVH-3 and 1 to 228 in BVH-11 were 73 and 75% 
identical at the DNA and protein level, respectively. In 
contrast, the 3' regions corresponding to amino acids 226 
to 1039 from BVH-3 and amino acids 229-840 from BVH-11 were 
only 34 and 22% identical at the DNA and protein level, 
respectively. Thus the 5' termini of BVH-3 and BVH-11 
genes appear to contain highly conserved sequences while 
the remaining parts of the genes are highly divergent. 
These results suggest that BVH-3 and BVH-11 might share 
similar functions mediated by sequences present in the 
conserved region whereas BVH-3- and BVH-ll-specif ic 
functions might be mediated by sequences in the divergent 
region. 



EXAMPLE 8 

This example describes the cloning of truncated BVH-3 . BVH- 
11 and BVH-11-2 genes by polymerase chain reaction (PCR) 
and the expression of truncated BVH-3 and BVH-11 molecules. 

Gene fragments were amplified by PCR using pairs of 
oligonucleotide engineered to amplify fragments spanning 
the BVH-3 (SEQ ID NO: 1 and SEQ ID NO: 11) , BVH-11 (SEQ ID 
NO* 3 and SEQ ID NO* 12) or BVH-11-2 (SEQ ID NO: 13) gene 
from iL_ pneumoniae strain SP64. Each of the primers had a 
restriction endonuclease site at the 5' end, thereby 
allowing directional in- frame cloning of the amplified 
product into the digested plasmid vector (Tables 2 and 3) . 
PCR-amplified products were digested with restriction 
endonucleases and ligated to either linearized plasmid 
PSL301 (see example 1), pCMV-GH (see example 2) or pET 
(Novagen, Madison, WI) expression vector digested likewise 
or digested with enzymes that produce compatible cohesive 
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ends. Recombinant pSL301 and recombinant pCMV-GH. plasmids 
were digested with restriction enzymes for the in-frame 
cloning in pET expression vector. Clones were first 
stabilized in IL_ coli DH5a before introduction into coli 
5 BL2KXDE3) or AD494 (XDE3) for expression of truncated BVH- 
3 or BVH-11 molecules. Each of the resultant plasmid 
constructs was confirmed by nucleotide sequence analysis. 
The recombinant proteins were expressed as N- terminal 
fusions with the thioredoxin and His-tag or as C-terminal 

10 fusions with an His-tag. The expressed recombinant 

proteins were purified from supernatant fractions obtained 
from centrifugation of sonicated IPT6- induced EL coli 
cultures using a His-Bind metal chelation resin (QIAgen, 
Chatsworth, CA) . The gene products generated are listed in 

15 the table 3. The gene products corresponding to the N- 
terminal region including the signal sequence are 
designated as Lipidated-proteins or lipoproteins (L- 
proteins) . The gene products corresponding to the N- 
terminal region lacking the signal sequence are identified 

20 as protein without signal sequence (w/o ss) . 

Table 2. List of PCR oligonucleotide primers 

25 



Primer 


SEQ. 
ID. 


Sequence 5' - 3' 


Nucleotide 
position 


Restric- 
tion 
sites 


OCRR 479 


17 


cagtagatctgtgcctatgcactaaac 


SEQ ID 1 :61- 
78 


BglH 


OCRR 480 


18 


gatctctagactactgctattccttacgctatg 


SEQ 

ID 11 :4909- 
4887 


Xbal 


OCRR 497 

f 


19 


atcactcgagcattacctggataatcctgt 


SEQ 

ID 1 :1525- 
1506 


Xhol 


OCRR 498 


20 


ctgctaagcttatgaaagatttagat 


SEQ 

ID 1:1534- 
1548 


HindEI 



48 



WO00/39299 PCT/CA99/01218 



OCRR499 



HAMJ 172 



21 



22 



gatactcgagctgctattccttac 



SEQ 

ID11 :4906- 
4893 



Xhol 



gaatctcgagttaagctgctgctaattc 



SEQ ID 1 
675-661 



Xhol 



HAMJ 247 



23 



gacgctcgagcgctatgaaatcagataaattc 



SEQJD 
1:3117-3096 



Xhol 



HAMJ 248 



24 



gacgctcgagggcattacctggataatcctgttcatg 



SEQ ID 
1 :1527-1501 



Xhol 



HAMJ 249 



25 



cagtagatctcttcatcatttattgaaaagagg 



| SEQ ID 11 : 
1749-1771 



Bglll 



HAMJ 278 



26 



ttatttcttccatatggacttgacagaagagcaaattaag 



HAMJ 279 



27 



I SEQ ID 
1 : 1414-1437 



Ndel 



cgccaagcttcgctatgaaatcagataaattc 



SEQ ID 
1 1 :31 17-3096 



iHindlll 



HAMJ 280 



28 



cgccaagcttttccacaatataagtcgattgatt 



SEQ ID 
1 1 :2400-2377 



HindJJJ 



HAMJ 281 



29 



tlatttcttccatatggaagtacctatcttggaaaaagaa 



SEQ ID 
' 1 :2398-2421 



Ndel 



HAMJ 300 



30 



ttatttcttccatatggtgcctatgcactaaaccagc 



I SEQ ID 1 :62- 
82 



Ndel 



HAMJ 313 



31 



ataagaatgcggccgcttccacaatataagtcgattgatt 



OCRR487 



32 



OCRR488 



33 



I SEQ ID 
1 :2400-2377 



NotI 



cagtagatctgtgcttatgaactaggtttgc 



SEQ JD 3 :58- 
79 



BglH 



gatcaagcttgctgctacctttacttactctc 



I SEQ ID 
12 :2577-2556 



iHindlH 



HAMJ 171 



34 



ctgagatatccgttatcgttcaaacc 



SEQ ID 
3:1060-1075 



EcoRV 



HAMJ 251 



35 



ctgcaagcttttaaaggggaataatacg 



SEQ ID 
3:1059-1045 



HindUJ 



HAMJ 264 



36 



cagtagatctgcagaagccttcctatctg 



SEQ ID 3:682- iBglJJ 
700 



HAMJ 282 



37 



tcgccaagcttogttatcgttcaaaccattggg 



HAMJ 283 



SEQ ID 
3:1060-1081 



HindJH 



38 



ataagaatgcggccgccttactctcctttaataaagccaat j SEQ JD 
HP 1 3 .2520-2492 



Ndel 



HAMJ 284 



39 



catgccatggacattgatagtctcttgaaacagc 



SEQ JD 3:856- |NcoI 
880 



HAMJ 285 



40 



cgccaagcttcttactctcctttaataaagccaatag 



SEQ ID 
3:2520-2494 



iHindJJI 



HAMJ 286 



41 



cgacaagcttaacatggtcgctagcgttacc 



SEQ ID 
3:2139-2119 



I HindDI 



HAMJ 287 



42 



cataccatgggcctttatgaggcacctaag 



SEQJD 
3:2014-2034 



INcoI 



HAMJ 288 



43 



cgacaagcttaagtaaatcttcagcctctctcag 



SEQ ID 

3:2376-2353 



iHindffl 



49 
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HAMJ289 


44 


gataccatggctagcgaccatgttcaaagaa 


SEQ ID 

3 :2125-2146 


Ncol 


HAMJ290 


45 


cgccaagcttatcatccactaacttgactttatcac 


SEQJD 

3 : 1533-1508 


HindJH 


HAMJ291 


46 


cataccatggatattcttgccttcttagctccg 


SEQJD 

3 :1531-1554 


Ncol 


HAMJ301 


47 


catgccatggtgcttatgaactaggtttgc 


SEQ ID 3 :59- 
79 


Ncol 


HAMJ302 


48 


cgccaagctttagcgttaccaaaaccattatc 


SEQJD 

3 :2128-2107 


Hindlll 


HAMJ 160 


49 


gtattagatctgttcctatgaacttggtcgtcacca 


SEQ ID 13 : 
172-196 


BglU 


HAMJ186 


50 


cgcctctagactactgtataggagccgg 


SEQJD 13: 
2460-2443 


Xbal 


HAMJ 292 


51 


catgccatggaaaacatttcaagccttttacgtg 


SEQJD 11: 
754-778 


Ncol 


HAMJ 293 


52 


cgacaagcttctgtataggagccggttgactttc 


SEQJD 11 : 
2457-2434 


Hindm 


HAMJ 294 


53 


catgccatggttcgtaaaaataaggcagaccaag 


SEQJD 11 : 
2038-2062 


Ncol 


HAMJ 297 


54 


catgccatggaagcctattggaatgggaag 


SEQJD 11 : 
622-642 


Ncol 



I 
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EXAMPLE 9 

This example describes the isolation of monoclonal 
antibodies (Mabs) and the use of Mabs to characterize BVH- 
5 3, BVH-11 and BVH-11-2 protein epitopes. 

Female BALB/c mice (Charles River) were immunized 
subcutaneously with BVH-3 . BVH-11 or BVH-11-2 gene products 
from S^. pneumoniae strain SP64 in presence of 15 ng of 
QuilA adjuvant (Cedar lane Laboratories Ltd, Hornby, 
Canada). One set of mice (fusion experiment 1) were 
immunized on day 1 and 14 with 25 ng of affinity purified 
thioredoxin-His»Tag-BVH-3M fusion protein. A second group 
of mice (fusion experiment 2) were immunized three times at 
three-week intervals with 25 ug of affinity purified 
thioredoxin-His«Tag-BVH-llM. A third group of mice (fusion 
experiment 3) were immunized on day 1 and day 15 with 25 ug 
of affinity purified thioredoxin-His»Tag-BVH-ll-2M fusion 
protein. A fourth group of mice (fusion experiment 4) were 
immunized on day 1 with 25 iig of affinity purified 
thioredoxin-His«BVH-HB fusion protein and boosted by 
intravenous injection on day 16 and on day 37 with 
recombinant BVH-11B in PBS . Three to four days before 
fusion, mice were injected intravenously with 25 fig of the 
respective antigen suspended in PBS alone. Hybridomas were 
produced by fusion of spleen cells with nonsecreting SP2/0 
myeloma cells as previously described by J. Hamel et al. 
[J. Med. Microbiol., 23, ppl63-170 (1987)]. Culture 
supernatants of hybridomas were initially screened by 
enzyme- linked- immunoassay according to the procedure 
described by Hamel et al. (Supra) using plates coated with 
preparations of purified recombinant proteins or 
suspensions of heat-killed S^. pneumoniae cells. Positive 
hybridomas selected on the basis of ELISA reactivity with a 
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variety of antigens were then cloned by limiting . dilutions, 
expanded and frozen. 



10 



15 



Hybridomas were tested by ELISA or Western immunoblotting 
against BVH-3 and BVH-11 gene products in order to 
characterize the epitopes recognized by the Mabs . BVH-3 
and BVH-11 shared common epitopes with 6 Mabs (H3-1-F9, H3- 
1-D4, H3-1-H12, H11-1-E7, H11-1-H10 and Hll-1 . 1-G11) 
showing reactivities with both proteins (Table 4) . BVH-11 
and BVH-11-2 molecules from SL pneumoniae SP64 shared 
common epitopes not present on BVH-3 with Mabs (3A1, 13C11, 
10H10, 1D8, 10G9, 10A2, 3E8, 10D7, 2H7 and 6H7) reactive 
with both, BVH-11 and BVH-11-2, recombinant proteins (Table 
5) . 

Table 4. Reactivity of BVH-3 -immunoreactive Mabs with a 
panel of BVH-3 and BVH-11 gene products 



MAbs 


la. Iimnun 


oreactivity with 


BVH-3M 
21-1039 


BVH-3 A 
21-509 


BVH-3B 
512-1039 


BVH-3C 
21-225 


NEW2 
472-800 


NEW3 
800-1039 


BVH-11M 
20-840 


H3-1-F9 


+ 


+ 




+ 






+ 


H3-1-D4 


+ 


+ 




+ 






+ 


H3-1-H12 


+ 


+ 




+ 






+ 


H3-2-G2 


+ 


+ 












H3-3-A1 


+ 


+ 












H3-4-D3 


+ 




+ 






+ 




H11-1-E7 


+ 


+ 




+ 






+ 


Hll-1- 
H10 


+ 


+ 




+ 






+ 


Hll- 
1.1-611 


+ 
i 


+ 




+ 


+ 




+ 
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Table 5. Reactivity of Mabs raised against BVH-11-2 
protein from S^. pneumoniae strain SP64 with a panel of BVH- 
11 gene products 



Mabs* 


b.Immunoreactivity with 


C . DVri— 


11 proc 


iucts |d. BVH-11-2 products 


BVH-11M 
20-840 


NEW8 
286-511 


511-713 


BVH-11B 
354-840 


jj BVH-11-2 
20-838 


NEW10 
271-838 


NEW11 
699-838 


NEW14 
227-699 


3A1 


+ 


+ 





+ 


+ 






+ 


13C1 


+ 


+ 


+ 


+ 


+ 






+ 


10H10 


+ 


+ 


+ 




+ 






+ 


1D8 


+ 


+ 


- 


+ 


+ 


+ 




+ 


10G9 


+ 


- 




+ 


+ 


+ 




+ 


10A2 


+ 






+ 


+ 


+ 




+ 


3E8 


+ 






+ 


+ 


+ 




+ 


10D7 


+ 






+ 


+ 


+ 




+ 


2H7 


+ 








+ 








6H7 


+ 








+ 








3A4 










+ 


+ 


+ 




14H6 










+ 


+ 






7G2 










+ 


+ 




+ 


13H10 










+ 






+ 


7E8 










+ 








7H6 










+ 
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15 



The results obtained from the immunoreactivity studies of 
the Mabs (Table 4 and Table 5) are in agreement with the 
protein sequences derived from the respective gene 
sequences. Indeed the Mabs cross -reactive with BVH-3 and 
BVH-11 molecules recognized BVH-3C protein corresponding to 
the conserved region, and BVH-11 and BVH-11-2 specific Mabs 
were reactive with epitopes located on variable parts of 
these molecules. BVH-3 and BVH-11, and BVH-11 and BVH-11-2 
can be distinguished by their reactivity with Mabs. 



EXAMPLE 10 
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This example illustrates the simultaneous expression of 
BVH-3 and BVH-11 gene products by pneumoniae . 

A standard Western blot technique was used to investigate 
whether BVH-3 and BVH-11 genes were expressed in S^. 
pneumoniae . S^ pneumoniae strain SP64 and SP63 were grown 
overnight at 37°C in 5% CO, on chocolate agar plates, 
bacteria were suspended in PBS and heat-killed at 56°C for 
20 min. For the preparation of antigens, suspensions of S^. 
pneumoniae were treated with sample buffer containing SDS 
and 2-mercaptoethanol for 5 min at 100°C. Pneumococcal 
protein antigens were resolved by SDS-PAGE electrophoresis 
according to the method of Laemmli [Nature, 227, pp. 680- 
685 (1970)]. After SDS-PAGE, the proteins were transferred 
electrophoretically from the gel to nitrocellulose paper by 
the method of Towbin [Proc. Natl. Acad. Sci. USA, 76, pp. 
4350-4354 (1979)] and probed with mouse antiserum or 
monoclonal antibodies. The detection of antigens reactive 
with the antibodies was performed by indirect enzyme- 
immunoassay using conjugated-anti-mouse immunoglobulins and 
a colour substrate. When antiserum raised to recombinant 
BVH-3 was tested against S^ pneumoniae SP64 antigens, two 
reactive bands having apparent molecular masses of 127 kDa 
and 99 kDa were detected. Bands having the same apparent 
molecular masses were also detected when Mabs H3-1-F9, H3- 
1-D4, H3-1-H12, H11-1-E7, H11-1-H10 and Hll-1 . 1-G11 were 
used individually as immunological probes. In contrast, 
Mabs specific for the BVH-3 molecule detected the 127 kDa 
band only and Mabs specific for BVH-11 detected the 99 kDa 
band only thus confirming the identity of the 127 and 99 
kDa bands as BVH-3 and BVH-11, respectively. These studies 
provide evidence that BVH-3 and BVH-11 proteins are 
simultaneously present on pneumoniae Moreover, the 
results are consistent with our previous observations that 
35 BVH-3 and BVH-11 possess epitopes that are common to both 
proteins and epitopes that are exclusive to either protein. 
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In L pneumoniae SP64, mature BVH-3, BVH-11 and BVH-11-2 
are proteins of 1019, 821 and 819 amino acids with 
predicted molecular mass of 112.5 kDa, 92.4 kDa, and 91.7 
kDa, respectively. Although there is a discrepancy between 
the molecular mass predicted from the sequence and the 
molecular mass calculated on SDS-PAGE, BVH-3 can be 
distinguished from BVH-11 by its higher molecular mass. 
Moreover, BVH-3 molecules from S^ pneumoniae strain SP63 
have an apparent molecular mass of 112 kDa in SDS-PAGE 
compared to 127 kDa for BVH-3 of SP64 strain. This data is 
consistent with the deletion of a stretch of 177 amino acid 
residues in BVH-3 of S^. pneumoniae strain SP63 . 



EXAMPLE 11 

This example describes the protection conferred in 
experimental infection of mice vaccinated with recombinant 
BVH-3 or BVH-11 gene products. 

Groups of 7 or 8 female BALB/c mice (Charles River) were 
immunized subcutaneous ly three times at three-week 
intervals with either affinity purified thioredoxin- 
His»Tag-BVH-3M fusion protein, affinity purified 
thioredoxin-His.Tag-BVH-llM fusion protein or, as control, 
with QuilA adjuvant alone in PBS. Twelve to 14 days 
following the third immunization, the mice were challenged 
intravenously with ^ pneumoniae WU2 strain or intranasally 
with P4241 strain. Samples of the pneumoniae challenge 
inoculum were plated on chocolate agar plates to determine 
the CFU and to verify the challenge dose. The challenge 
dose was approximately 10 6 CFU. Deaths were recorded for a 
period of 14 days and on day 14 post-challenge, the 
surviving mice were sacrificed and blood samples tested for 
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the presence of pneumoniae organisms. The survival data 
are shown in Tables 6 and 7 . 



Table 6. Protection mediated by recombinant BVH-3M and 
BVH-11M proteins in experimental infection with virulent S, 
pneumoniae WU2 



Experiment 


Immunogen 


Alive : dead" 


Median days alive 


1 


BVH-3M 
none 


8 : 0 
0 : 8 


>14 
1 


2 


BVH-11M 
none 


8 : 0 
0 : 8 


>14 
1 



14 post-challenge, 



the number of mice dead on day- 



Table 7. Protection mediated by recombinant BVH-3M and 
BVH-11M proteins in experimental pneumonia with virulent S, 
pneumoniae P4241 



Experiment 


Immunogen 


Alive : 


dead" 


Median day alive 


1 


BVH-3M 


6 


: 1 




>14 




none 


1 


7 




4.5 


2 


BVH-3M 


8 : 


0 




>14 




BVH-11M 


8 : 


0 




>14 


a ml% A i 


none 


0 : 


8 




4 



14 post-challenge. 



the number of mice dead on day 



All mice immunized with recombinant BVH-3M or BVH-11M 
protein survived to infection with WU2 while none of the 
control mice given adjuvant alone survived. All except one 
mice immunized with recombinant BVH-3M or BVH-11M protein 
survived to infection with P4241 while only one control 
mice given adjuvant alone survived. All hemocultures from 
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surviving mice were negative at day 14 post-challenge. 
These results clearly indicate that both, BVH-3M and BVH- 
11M, elicit protective ant i -pneumococcal immune responses 
in mice. The fact that these proteins are highly conserved 
among pneumoniae isolates emphasize the potential of 
BVH-3 and BVH-11 as universal vaccine candidates. Indeed, 
the BVH-3 and BVH-11 proteins from serogroup 6 S. 
pneumoniae strain SP64 elicited protection against 
pneumococcal infections with strains of different capsular 
serotypes . 

Ideally, a vaccine that could protect against pneumococcal 
disease, could protect against meningitis, otitis media, 
bacteremia and pneumonia. BVH-3 and BVH-11 were protective 
against lethal systemic- and pneumonia-infection models 
thus suggesting that, in humans, BVH-3- and BVHll-protein- 
based vaccines could reduce the incidence of a wide 
spectrum of disease caused by virtually all pneumoniae 
independently of the capsular serotype. 

Data from Tables 6 and 7 clearly demonstrate that BVH-3 and 
BVH-11 were, both, protection-eliciting molecules of S. 
pneumoniae . It was not known, however, whether protection 
can be mediated by specific sequences that were not shared 
on BVH-3 and BVH-11 molecules. Groups of female BALB/c 
mice (Charles River) were immunized subcutaneously three 
times at three-week intervals with either affinity purified 
thioredoxin-His»Tag- BVH-3 AD, -BVH-3B or -BVH-3C fusion 
protein in presence of 15 /xg of QuilA adjuvant (Cedarlane 
Laboratories Ltd, Hornby, Canada) . Control mice were 
immunized with QuilA adjuvant alone in PBS or affinity 
purified thioredoxin-His*Tag or thioredoxin-His^Tag- fusion 
protein (His-Thio) in presence of QuilA. 

To determine the protective ability of a set of truncated 
proteins, termed NEW4, NEWS, NEW6, NEW7, NEW8, NEW9, NEW10, 
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NEW11, NEW14 and BVH-11B, groups of female BALB/c mice 
(Charles River) were immunized subcutaneously two times at 
three-week intervals with 25 ng of either affinity purified 
His«Tag- fusion protein in presence of 15 ng of QuilA 
adjuvant. Ten to 14 days following the last immunization, 
the mice were challenged with virulent S^. pneumoniae . Our 
results indicate that, BVH-3B, a truncated BVH-3 molecule 
consisting of amino acids 512-1039, elicited protection 
against the mouse-virulent strains WU2 and P4241. 
Similarly, BVH-11B, NEW4 and NEW5 molecules, three 
truncated BVH-11 molecules consisting of amino acids 354- 
840, amino acids 286-840 and amino acids 286-713, 
respectively, elicited protection against experiment 
intravenous challenge with WU2 and intranasal challenge 
with P4241. Moreover, vaccination with NEW10 and NEW14, 
consisting of amino acids 272-838 and amino acids 227-699 
from BVH-11-2 molecule also resulted in protection against 
death with the pneumococcal strains. These results 
indicate that the region comprising 428 amino acids 
extending from amino acids 286-713 and amino acids 272-699 
on S^. pneumoniae SP64 BVH-11 and BVH-11-2 protein 
sequences, respectively, contains protective epitopes. 
This region is highly conserved with a global 91% identity 
and 94% homology among thirteen BVH-11 protein sequences. 
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Table 8. Evaluation of protection elicited by vaccination 
of mice with BVH-3 and BVH-11 gene products 







Challenge with WU2 


| Challenge withP4241 


Experiment 

„ b "'. 


Immunogen 


Alive : dead 3 


Median day 
alive 


1 Alive : dead 


Median day 
alive 


1 


None 


0 : 8 


1.5 


1 ; V 


4.5 




NEW4 


8 : 0 


>14 


8 : 0 


>14 




NEWS 


8 : 0 


>14 


8 : 0 


>14 




NEW7 


0 : 8 


2 


0 : 8 


5 




BVH-11M 


8 : 0 


>14 


8 : 0 


>14 


2° 


None 


0 : 8 


1 


0 : 8 


4 




NEWS 


8 : 0 


>14 


8 : 0 


>14 




NEW8 


0 : 8 


1.5 


0 : 8 


5.5 




NEW9 


3 : 5 


3.5 


2 : 6 


7 




BVH-11M 


8 : 0 


>14 


8 : 0 


>14 


3° 


None 


0 : 8 


1 


0 : 8 


4 




NEW6 


0 : 8 


1 


4 : 4 


10. 5 e 




NEW10 


8 : 0 


>14 


8:0 


>14 




NEW11 


0 : 8 


1.5 


1 : 7 


6 




BVH-11M 


8 : 0 


>14 


8 : 0 


>14 


4 b 


None 


0 : 8 


2 


0 : 8 


4 




BVH-11B 


7 : 1 


>14 


8 : 0 


>14 




NEW14 


8 : 0 


>14 


8 : 0 


>14 


5 


His-Thio 
BVH-3 AD 
BVH-3B 


0 : 8 

1 : 7 
5 : 3 


2 

2.5 
>14 






6 


His-Thio 
BVH-3C 


0 : 8 
0 : 8 


1 
1 







The number of mice alive : the number of mice dead on day 



14 post -challenge. 
b The WU2 challenge dose was 10 5 CFU. 

c Mice living longer than 14 days were assigned a survival 
time of 14 days for the determination of median values. 
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EXAMPLE 12 



This example described the cloning and expression of a 
chimeric gene encoding for a chimeric polypeptide 
corresponding to the carboxy- terminal region of BVH-3 in 
fusion at the C end to the carboxy- terminal region of BVH- 
11 and the additive protection observed after vaccination 
with a chimeric polypeptide. 

It is clear from the studies described above that BVH-3 and 
BVH-11 are serologically distinct molecules simultaneously 
present on pneumoniae . The results of immunological 
studies of mice indicate that both proteins are good 
vaccine candidates. These proteins have the potential to 
provide protection against all pneumococci, regardless of 
serotype. Even though the two proteins share epitopes and 
sequences, they have different characteristics and may 
serve different biological functions. Thus, immunization 
against the two proteins may provide a higher level of 
protection than that imparted by each individually. To 
examine this, several avenues where full-length or 
truncated BVH-3 and BVH-11 are administered in combination 
or in conjugation can be explored. Here we describe the 
genetic engineering of a BVH-3 -BVH-11 fusion gene and 
protein, termed NEW12 (SEQ ID NO: 76 and SEQ ID NO: 58, 
respectively) , and the potential use of NEW12 protein as a 
vaccine. 



fiVH" 3 and BVH-11 gene fragments corresponding to the 3' end 
of the genes were amplified by PCR using pairs of 
oligonucleotides engineered to amplify fragments spanning 
nucleotides 1414 to 3117 (SEQ ID NOi 1) and nucleotides 1060 
to 2520 (SEQ ID NO: 3) from pneumoniae strain SP64 BVH-3 
and BVH-11 genes, respectively. The primers used, HAMJ278 
and HAMJ279; HAMJ282 and HAMJ283 had a restriction 
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10 



15 



20 



5 



endonuclease site at the 5' end, thereby allowing 
directional in-frame cloning of the amplified product into 
the digested pET21b(+) plasmid vector (Table 2). PCR- 
amplified products were digested with restriction 
endonucleases and ligated to linearized plasmid pET21b(+) 
vector digested likewise. The resultant plasmid constructs 
were confirmed by nucleotide sequence analysis. The 
recombinant pET21b(+) plasmid containing the Ndel-Hindlli 
BVH-3 PCR product was linearized by digestion with the 
restriction enzymes Hindlll and NotI for the in- frame 
cloning of the Hindlll-NotI DNA fragment obtained from the 
recombinant pET21(+) vector containing the BVH-11 gene 
fragment. Clones were first stabilized in coli DH5a 
before introduction into L coli BL21(XDE3) for expression 
of a chimeric pneumococcal protein molecule. The 
recombinant chimeric polypeptide, termed NEW 12, was 
expressed as C-terminal fusion with an His- tag. The 
expressed recombinant NEW 12 protein was purified from 
supernatant fractions obtained from centrifugation of 
sonicated IPTG-induced E^ coli cultures using a His-Bind 
metal chelation resin (QIAgen, Chatsworth, CA) . 

According to the same procedure described above, it is 
possible to construct other chimeric polypeptides, as a 
result of a simultaneous expression of New 1 and New 4, New 
1 and New 5, New 1 and New 10, or New 1 and New 14. The 
construction can be with New 1 upstream or downstream of 
New 4, New 5, New 10, BVH-11B or New 14. It is also 
possible to construct other chimeric polypeptides as a 
result of a simultaneous expression of more, than two 
fragments of either genes of BVH-3 , BVH-11 or BVH-11-2. 

Groups of 8 female BALB/c mice (Charles River) were 
immunized subcutaneously two times at three-week intervals 
with 25 ug of either affinity purified His»Tag- fusion NEW1, 
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BVH-11B or NEW12 protein in presence of 15 (xg of QuilA 
adjuvant. Ten to 14 days following the last immunization, 
the mice were challenged with virulent pneumoniae . As 
demonstrated before, NEW1 and BVH-11B molecules comprising 
amino acids 472 to 1039 from BVH-3 protein and amino acids 
354-840 from BVH-11 protein, respectively, correspond to 
portions of the proteins capable of eliciting a protective 
immune response. To determine if a chimeric polypeptide 
would significantly improve the protection compared with 
those seen for the individual counterparts, the challenge 
dose was adjusted in a manner that protection was not 
expected with NEW1 and BVH-11B molecules. Interestingly, 
the chimeric NEW12 protein, elicited protection against the 
mouse-virulent strains WU2 and P4241. Seven out of 8 mice 
immunized with NEW12 were still alive 10 days after the 
challenge while 28 out of 32 mice immunized with NEW1, BVH- 
11B, BVH-3M or adjuvant alone were dead by five days post- 
challenge. Thus, vaccination of mice with NEW12 provided 
the highest degree of protection against WU2 challenge. 
These results indicate that immunization with a chimeric 
polypeptide and possibly a combination of BVH-3 and BVH-11 
gene products can provide additional protection to that 
obtained by administration of BVH-3 or BVH-11 antigens 
alone . 



Table 9. Evaluation of protection elicited by vaccination 
of mice with the chimeric NEW12 molecule 





Challenge with WU2 


Challenge with P4241 


Immunogen 


Alive : dead 3 


Median day 


Alive : dead 


Median day 






alive 




alive 


None 


0:8 


1 


0 : 8 


5 


NEW1' 


2 : 6 


2 


1 : 7 


8 


BVH-11B 


1 : 7 


3.5 


8 : 0 


>14 


NEW12 


6:2 


>14 


7 : 1 


>14 


BVH-3M 


1:7 


3 




>14 
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EXAMPLE 13 



5 



10 



15 



20 



25 



This example illustrates the identification of additional 
BVH^l and ffiffl-u related sequences in Streptococcus species 
other than £L_ pneumoniae . 

It was previously shown that BVH-3, BVH-11 and BVH-11-2 are 
a family of related proteins sharing common sequences. 
Homology searches were performed with the nucleotide 
sequence from the conserved region of these genes and 
compared with GenBank and EMBL sequences using FASTA. The 
most significant homology was observed with a 2.469-kb gene 
coding for a calculated 92-kDa protein (SEQ XD NO: 81) of 
unknown function in agalactia also called group B 
streptococcus or GBS. The gene was designated BVH-71. A 
protein demonstrating 99.2% identity and 99.5% similarity 
with that of GBS was also identified in ^ P vog enas also 
called group A streptococcus or GAS (SEQ ID NO: 83). The 5' 
region of the BVH-71 sequences (SEQ id NO: 80 and SEQ ID 
NO: 82), spanning nucleotides 1 to 717, demonstrated 58 and 
60% identity with the conserved regions of BVH-3 
(nucleotides 1 to 675) and BVH-11 (nucleotides 1 to 684) 
genes respectively. The first 239 amino acids of the 
translated sequences of the GBS and GAS BVH-71 open reading 
frames are 51 and 54% identical to the first 225 and 228 
amino acids of BVH-3 and BVH-11, respectively. In addition ^ 
to structural similarities, streptococcal BVH-3, BVH-11 and 
BVH-71 proteins also share antigenic epitopes. A 97-kDa 
band Was revealed on Western blots of GAS or GBS whole 
cells, using Mab Hll-l.i-Gll reactive with the BVH-3 and • 
BVH-11 conserved regions. Similarly, GAS and GBS 
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recombinant BVH-71 proteins were detected in Western 
immunoblot analysis. 

These results indicate that BVH-71, BVH-3 and BVH-11 
proteins might share similar functions. Our results also 
suggest that BVH-71 proteins can be used as protein vaccine 
components of anti -streptococcus . In a further embodiment 
BVH-71 proteins can be used as protein vaccine components 
of anti-GAS or anti-GBS vaccines. 
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What is claimed is: 

1. An isolated polynucleotide encoding a polypeptide having 
at least 70% identity to a second polypeptide having a 
sequence chosen from: SEQ id NOs: 2, 4, 6, 8, 10, 14, 16, 
55 to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof. 

2. A polynucleotide according to claim 1, wherein said 
polynucleotide encodes a polypeptide having at least 95% 
identity to the second polypeptide. 

3. An isolated polynucleotide encoding a polypeptide capable 
of generating antibodies having binding specificity for a 
polypeptide having a sequence chosen from: SEQ ID NOs: 2, 
4, 6, 8, 10, 14, 16, 55 to 75, 77 to 79, 81, 83 or 
fragments, analogs or derivatives thereof. 

4 . An isolated polynucleotide that is complementary to the 
polynucleotide of claim 1. 



An isolated polynucleotide that is complementary to the 
polynucleotide of claim 3. 

The polynucleotide of claim 1, wherein said 
polynucleotide is DNA. 

The polynucleotide of claim 3, wherein said 
polynucleotide is DNA. 

The polynucleotide of claim 1, wherein said 
polynucleotide is RNA. 

The polynucleotide of claim 3, wherein said 
polynucleotide . is RNA. 
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10. A vector comprising the polynucleotide of claim 1, 
wherein said DNA is operably linked to an expression 
control region. 

11. A vector comprising the polynucleotide of claim 3, 
wherein said DNA is operably linked to an expression 
control region; 

12. A host cell transfected with the vector of claim 10. 

13. A host cell transfected with the vector of claim 11. 

14. A process for producing a polypeptide comprising 
culturing a host cell according to claim 12 under 
conditions suitable for expression of said polypeptide. 

15. A process for producing a polypeptide comprising 
culturing a host cell according to claim 13 under 
condition suitable for expression of said polypeptide. 

16. An isolated polypeptide having at least 70% identity to a 
second polypeptide having an amino acid sequence chosen 
from: SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 75, 77 
to 79, 81, 83 or fragments, analogs or derivatives 
thereof . 



17, 



18, 



An isolated polypeptide capable of generating antibodies 
having binding specificity for a second polypeptide 
having a sequence chosen from: SEQ ID NOs: 2, 4, 6, 8, 
10, 14, 16, 55 to 75, 77 to 79, 81, 83 or fragments, 
analogs or derivatives thereof. 

An isolated polypeptide having an amino acid sequence 
chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 75, 
77 to 79, 81, 83 or fragments, analogs or derivatives 
thereof. 
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19. An isolated polypeptide according to claim 18, wherein the 
N- terminal Met residue is deleted. 



20 



An isolated polypeptide according to claim 18, wherein the 
secretory amino acid sequence is deleted. 



21. A chimeric polypeptide comprising two or more 
polypeptides chosen from SEQ id NOs: 2, 4, 6, 8, 10, 14, 
16, 55 to 75, 77 to 79, 81, 83 or fragments, analogs or' 
derivatives thereof; provided that the polypeptides or 
fragments, analogs or derivatives thereof are linked as 
to form a chimeric polypeptide. 

22 . A chimeric polypeptide comprising two or more polypeptides 

chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 69, 

72, 74, 77 or fragments, analogs or derivatives thereof; 
provided that the polypeptides or fragments, analogs or' 
derivatives thereof are linked as to form a chimeric 
polypeptide. 

23. A chimeric polypeptide of formula (I): 
A-(B).-(C) B -D (I) 

Wherein; 

m is 0 or 1, 

n is 0 or 1, 

A is chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 
to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof; 

B is chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 
to 73, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof; 

C is chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 
to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof; and 

D is chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 
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to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof. 

A chimeric polypeptide of formula (I) : 
A -(B).-(C) n -D (I) 
Wherein ; 
m is 0 or 1, 
n is 0 or 1, 

A is chosen from SEQ ID NOs iIO, 58, 60, 62, 64, 67, 68, 

69, 72, 74, 77 or fragments, analogs or derivatives 
thereof; 

B is chosen from SEQ ID NOs xlO, 58, 60, 62, 64, 67, 68, 

69, 72, 74, 77, or fragments, analogs or derivatives 
thereof; 

C is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 

69, 72, 74, 77 or fragments, analogs or derivatives 
thereof; and 

D is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 

69, 72, 74, 77 or fragments, analogs or derivatives 
thereof . 

A vaccine composition comprising a polypeptide according 
to any one of claims 16 to 24 and a pharmaceutically 
acceptable carrier, diluent or adjuvant. 

26. A method for therapeutic or prophylactic treatment of 
meningitis, otitis media, bacteremia or pneumonia 
infection in an individual susceptible to meningitis, 
otitis media, bacteremia or pneumonia infection comprising 
administering to said individual a therapeutic or 
prophylactic amount of a composition according to claim 
25. 

27. A method for therapeutic or prophylactic treatment of 
streptococcal bacterial infection in an individual 



25 
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susceptible to streptococcal infection comprising 
administering to said individual a therapeutic or 
prophylactic amount of a composition according to claim 
25. 



A method according to claim 26, wherein said individual is 
a mammal. 

A method according to claim 27, wherein said individual is 
a human . 

A method according to claim 22, wherein said bacterial 
infection is S. pneumoniae, group A streptococcus 
(pyogenes) , group B streptococcus (GBS or agalactiae) , 
dysgalactiae, uberis, nocardia or Staphylococcus aureus. 

A method according to claim 26, wherein said bacterial 
infection is S.pneumoni^ g. 

Use of a vaccine composition according to claim 25 for 
the prophylactic or therapeutic treatment of 
Streptococcal infection in an animal susceptible to or 
infected with streptococcal infection comprising 
administering to said animal a prophylactic or 
therapeutic amount of the composition. 
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ATGAAATTTA 
TGTGCCTATG 
TATGTGGATG 
CAGAAAGAAG 
ACGTCACACG 
AGTGAAGAAC 
GAAGTCAAGG 
GCAGCTCATG 
GTCAAAGATA 
ACGACAAATG 
TATATCGTTC 
GAATTAGCAG 
TATTCTTCAA 
CCAGCAAATA 
GCCCAACGTT 
ACACCAAATG 
CTTTCTGCTT 
GTTTCTACAA 
CCTTCTTCTT 
CCAAAAGATA 
CATTACATTC 
ACACCTTCTC 
GGATACGGAT 
CACGGAGACC 
GCGCAAAAAC 
CATGAACAGG 
GAAAAAATTG 
AAAGAAAAAA 
GAACATAAAC 
GAAGGAGTTG 
GTTAATTTGT 
AAACGCGTTT 
GTAAAATTAA 
GAAGGAGTAG 
TTTAAGTATA 
GTTCCAACCT 
GGGGATACTT 
GTCAGAGTGT 
GAAATCAAAT 
ATTCCTGTAA 
GTACCTATCT 
AGGAATAAAG 
GAGAAGGTAG 
GAAGAAGTTC 
GGGATGAAGC 
TCAGGAGAAG 
GGTGAAAATA 
ACAGAAAATA 
GAAAACTCAA 
TTAGATCCA6 
TTTACAGCTA 
GAATTAAGAT 
(SEQ ID NOt 



GTAAAAAATA 
CACTAAACCA 
GCAGCCAGTC 
GAATTCAGGC 
GTGACCACTA 
TCTTGATGAA 
GTGGTTATAT 
CTGATAATGT 
ATGAGAAGGT 
ATGGTTATGT 
CTCATGGAGG 
CAGCTAAAGC 
CAGCTAGTGA 
AATCTGAAAA 
ACAGTGAATC 
GAGTTGCGAT 
TAGAAGAAAA 
ATGCAAAACC 
TAACGACAAG 
TCGTTGAAGA 
CAAAATCAAA 
CATCTCTTCC 
TTGATGCTAA 
ACAATCATTA 
ATTTAGAGGA 
ATTATCCAGG 
CTGGCATTAT 
ATGCGATTAT 
CGGTTGGAAT 
CTAAAAAAGA 
TAAAAAATAG 
CTTTTAGTTT 
TAACACCAGA 
GGAATATTGC 
CTATCGCTTC 
CTTTAGCTTA 
ATTTAAGAGT 
TTGATGAATT 
TACCGATTCC 
CCTTCATGGC 
TGGAAAAAGA 
CACAAGAAAA 
AAAAAGAAAA 
CTACAGTGGA 
TAGAAAATGT 
TCATTAAAAA 
AACCATCTGA 
AACCAGCAGA 
CGGATAATGG 
CATTAGAGGA 
GTTACGGATT 
TGCCAAGTGG 
1) 



TATAGCAGCT GGATCAGCTG 
GCATCGTTCG CAGGAAAATA 



AAGTCAGAAA AGTGAAAACT 
TGAGCAAATT GTAATCAAAA 
TCATTACTAT AATGGGAAAG 
GGATCCAAAC TATCAACTTA 
CATCAAGGTC GATGGAAAAT 
TCGAACTAAA GATGAAATCA 
TAACTCTAAT GTTGCTGTAG 
CTTTAATCCA GCTGATATTA 
TCACTATCAC TACATTCCCA 
ACATCTGGCT GGAAAAAATA 
CAATAACACG CAATCTGTAG 
TCTCCAGAGT CTTTTGAAGG 
AGATGGCCTG GTCTTTGACC 
TCCGCATGGC GACCATTACC 
GATTGCCAGA ATGGTGCCTA 
TAATGAAGTA GTGTCTAGTC 
TAAGGAGCTC TCTTCAGCAT 
AACGGCTACA GCTTATATTG 
TCAAATTGGG CAACCGACTC 
AATCAATCCA GGAACTTCAC 
TCGTATTATC GCTGAAGATG 
TTTCTTCAAG AAGGACTTGA 
AGTTAAAACT AGTCATAATG 
TAATGCCAAA GAAATGAAAG 
GAAACAATAT GGTGTCAAAC 
TTATCCGCAT GGAGATCACC 
TGGTCATTCT CACAGTAACT 
AGGGAATAAA GTTTATACTG 
TACGTTTAAT AATCAAAACT 
TCCGCCTGAA TTGGAGAAAA 
TGGAAAAGTA TTGGAGAAAG 
AAACTTTGAA TTAGATCAAC 
AAAAGATTAT CCAGAAGTAA 
CAAAATGGCC AGTCAAACGA 
GAACCCTCAA TTTGCAGTGC 
TCATGGAAAT GCTTATTTAG 
GAAATTAAAC CAAGGAACAA 
AAATGCTTAT TTGGACAATC 
AAATCAAACT GATAAACCAA 
C TCAA AACTT GATGAAAAGG 
ACTTTCTGAA ACTGGGAATA 
TCCTGTACAA GAAAAAGTAG 
CTTGTTTAAT ATGGACGGAA 
GAATATGGCA GATTTTACAG 
AAATGGAAAA GTATCTACTG 
TTCTTTACCA GAGGCACCAA 
AATGTTGAAT CCAGAAGGGA 
AGCTCCAGCA GTAGATCCTG 
AGGCTTAGAT AGTGTTATAT 
AGAAGTGATA AAAAAGAATT 
FIGURE 1 



TTATCGTATC 
AGGACAATAA 
TGACACCAGA 
TTACAGATCA 
TTCCTTATGA 
AAGACGCTGA 
ATTATGTCTA 
ATCGTCAAAA 
CAAGGTCTCA 
TCGAAGATAC 
AAAGCGATTT 
TGCAACCGAG 
CAAAAGGATC 
AACTCTATGA 
CTGCTAAGAT 
ACTTTATTCC 
TCAGTGGAAC 
TAGGCAGTCT 
CTGATGGTTA 
TAAGACATGG 
TTCCAAACAA 
ATGAGAAACA 
AATCAGGTTT 
CAGAAGAGCA 
GATTAGATTC 
ATTTAGATAA 
GTGAAAGTAT 
ATCATGCAGA 
ATGAACTGTT 
GAGAAGAATT 
TTACTCTAGC 
AATTAGGTAT 
TATCTGGTAA 
CTTATTTACC 
GTTATGATGG 
TTTTCTATCC 
CTAAAGGAAC 
AAAATAACTA 
CCAGAACGGC 
AATCGACTTA 
GTATTCTACC 
TAGAAGAACC 
GTACTAGTAA 
CAAAATTTGC 
CAATTGAATT 
GAGAAGCACC 
GAACAGTTGA 
ACGAAAAACC 
ATGTGGGGAG 
TAGAAGAAAA 
TCAATATGGA 
TATCTGATTT 



CTTGAGTCTA 
TCGTGTCTCT 
CCAGGTTAGC 
GGGCTATGTA 
TGCCCTCTTT 
TATTGTCAAT 
CCTGAAAGAT 
ACAAGAACAT 
GGGACGATAT 
GGGTAATGCT 
ATCTGCTAGT 
TCAGTTAAGC 
AACTAGCAAG 
TTCACCTAGC 
TATCAGTCGT 
TTACAGCAAG 
TGGTTCTACA 
TTCAAGCAAT 
TATTTTTAAT 
TGATCATTTC 
TAGTCTAGCA 
TGAAGAAGAT 
TGTCATGAGT 
AATTAAGGCT 
TTTGTCATCT 
AAAAATCGAA 
TGTCGTGAAT 
TCCGATTGAT 
TAAACCCGAA 
AACGAATGTT 
CAATGGTCAA 
CAATATGCTA 
AGTATTTGGA 
AGGACAAACA 
TACATTTACA 
TTTCCATGCA 
TGATGCTTTA 
TAAAGTTGGT 
CGGAAATAAA 
TATTGTGGAA 
ACAATTTAAA 
AAAGACTAGT 
TTCAACGTTA 
TGAAAGTTAT 
ATATTTACCA 
TCAAGGAAAT 
GAACCAACCA 
TGTAAAACCA 
TGACCCTATG 
ATTAGAAAAA 
TGGAACGATT 
CATAGCGTAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
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MKFSKKYIAA GSAVIVSLSL CAYALNQHRS QENKDNNRVS YVDGSQSSQK 50 

SENLTPDQVS QKEGIQAEQI VIKITDQGYV TSHGDHYHYY NGKVPYDALF 100 

SEELLMKDPW YQLKDADIVN EVKGGYIIKV DGKYYVYLKD AAHADNVRTK 150 

DEINRQKQEH VKDNEKVNSN VAVARSQGRY TTNDGYVFNP ADIIEDTGNA 200 

YIVPHGGHYH YIPKSDLSAS ELAAAKAHLA GKNMQPSQLS YSSTASDNNT 250 

QSVAKGSTSK PANKSENLQS LLKELYDSPS AQRYSESDGL VFDPAKIISR 300 

TPNGVAIPHG DHYHFIPYSK LSALEEKIAR MVPISGTGST VSTNAKPNEV 350 

VSSLGSLSSN PSSLTTSKEL SSASDGYIFN PKDIVEETAT AYIVRHGDHF 4 00 

HYIPKSNQIG QPTLPNNSLA TPSPSLPINP GTSHEKHEED GYGFDANRII 450 

AEDESGFVMS HGDHNHYFFK KDLTEEQIKA AQKHLEEVKT SHNGLDSLSS 500 

HEQDYPGNAK EMKDLDKKIE EKIAGIMKQY GVKRESIWN KEKNAIIYPH 550 

GDHHHADPID EHKPVGIGHS HSNYELFKPE EGVAKKEGNK VYTGEELTNV 600 

VNLLKNSTFN NQNFTLANGQ KRVSFSFPPE LEKKLGINML VKLITPDGKV 650 

LEKVSGKVFG EGVGNIANFE LDQPYLPGQT FKYTIASKDY PEVSYDGTFT 700 

VPTSLAYKMA SQTIFYPFHA GDTYLRVNPQ FAVPKGTDAL VRVFDEFHGN 750 

AYLENNYKVG EIKLPIPKLN QGTTRTAGNK IPVTFMANAY LDNQSTYIVE 800 

VPILEKENQT DKPS ILPQFK RNKAQENSKL DEKVEEPKTS EKVEKEKLSE 850 

TGNSTSNSTL EEVPTVDPVQ EKVAKFAESY GMKLENVLFN MDGTIELYLP 900 

SGEVIKKNMA DPTGEAPQGN GENKPSENGK VSTGTVENQP TENKPADSLP 950 

EAPNEKPVKP ENSTDNGMLN PEGNVGSDPM LDPALEEAPA VDPVQEKLEK 1000 

FTASYGLGLD SVIFNMDGTI ELRLPSGEVI KKNLSDFIA (SEQ ID NO: 2) 1039 

FIGURE 2 
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ATGAAAATCA 
GCTTATQAAC 
ATAGATGGAA 
CGTGAAGGAA 
TCTCATGGAG 
GAAGAGCTCC 
ATCAAGGGTG 
GCTCATGCGG 
CAGCATCGTG 
GOACGCTACA 
GGCGATGCCT 
TCAGCTAGCG 
TTAAGAACCT 
GTAAGCAATC 
GCAAGTCAAA 
CAACGCCATG 
GCCAGAGGTG 
TCTGAATTGG 
GTACCAGATT 
CCGCAACCTG 
AAAGAAGCTG 
TATATCCCAG 
AAGCAGGAAA 
CGAGAATTTT 
AATAAAGGTC 
GTCTCAAGTG 
CATCCAGAAC 
GTAGCCAAGT 
ATAACCAGTG 
AAAAAAGATA 
GGTTTGACCC 
GAAGCTATCT 
AATCTTCAAT 
TACCATAACA 
ACTCTTGAGG 
CATTCAGATA 
GCTGATACCA 
GAAGAAACCC 
GAGGAACCAG 
GTTGAAGAAA 
AAGTCCAATG 
GACAACAATA 
TAA (SEQ 



ATAAAAAATA 
TAGGTTTGCA 
AACAAGCGAC 
TCAACGCCGA 
ACCATTATCA 
TCATGAAAGA 
GTTATGTCAT 
ATAATGTCCG 
AAGGAGGGAC 
CCACAGATGA 
ATATCGTTCC 
AGTTGGCTGC 
ATCGCCGACA 
CAGGAACTAC 
GTAATGACAT 
TAGAATCTGA 
TAGCTGTCCC 
AAAAACGAAT 
CAAGACCAGA 
CACCAAATCC 
TTCGAAAAGT 
CCAAGAATCT 
GTTTATCTCA 
ACAATAAGGC 
GACAAGTTGA 
ATAAAGTCAA 
GTTTAGGAAA 
TGGCAGGCAA 
ATGAGGGGGA 
GTTTGTCTGA 
CTCCTTCGAC 
ACAACCGCGT 
ATACTGTAGA 
TCAAATTTGA 
ATCTTTTGGC 



ATGGTTTTGG 
ATCAAACGGA 
CTCGAGAAGA 
AAGAAGAATC 
AACTGAGAGA 
CCAAAGAGAC 
CTATTATGGC 
ID NO: 3) 



TCTAGCTGGG 
TCAAGCTCAA 
GCAAAAAACG 
ACAAATCGTC 
TTACTATAAT 
TCCGAATTAT 
TAAGGTAAAC 
TACAAAAGAA 
TTCAGCAAAC 
TGGTTATATC 
TCATGGAGAT 
TGCAGAAGCC 
AAATAGCGAT 
AAATACTAAC 
TGATAGTCTC 
TGGCCTTATT 
TCATGGTAAC 
TGCTCGTATT 
AGAACCAAGT 
TCAACCAGCT 
AGGCGATGGT 
TTCAGCAGAA 
TAAGCTAGGA 
TTATGACTTA 
TTTTGAGGCT 
GTTAGTGGAT 
ACCAAATGCG 
GTACACAACA 
TGCCTATGTA 
AGCTGAGAGA 
AGACCATCAG 
GAAAGCAGCT 
AGTCAAAAAC 
GTGGTTTGAC 
GACTGTCAAG 
TAACGCTAGC 
AAAACCAAGC 
GAAACCACAA 
ACGAGAGGAA 
GGCTGAAGAT 
TCTCACAGGA 
AGAAGCTGAA 



TCAGTAGCTA 
ACTGTAAAAG 
GAGAATTTGA 
ATCAAGATTA 
GGCAAGGTCC 
CAGTTGAAGG 
GGTAAATACT 
GAAATCAATC 
GATGGTGCGG 
TTCAATGCAT 
CATTACCATT 
TTCCTATCTG 
AACACTCCAA 
ACAAGCAACA 
TTGAAACAGC 
TTCGACCCAG 
CATTACCACT 
ATTCCCCTTC 
CCACAACCGA 
CCAAGCAATC 
TATGTCTTTG 
ACAGCAGCAG 
GCTAAGAAAA 
CTAGCAAGAA 
TTGGATAACC 
GATATTCTTG 
CAAATTACCT 
GAAGACGGTT 
ACTCCACATA 
GCGGCAGCCC 
GATTCAGGAA 
AAGAAGGTGC 
GGTAGTTTAA 
GAAGGCCTTT 
TACTATGTCG 
GACCATGTTC 
GAGGAGAAAC 
AGCGAGAAAC 
TCAGAAGAAC 
TTACTTGGAA 
TTAAAAAATA 
AAACTATTGG 



CACTTGTTTT 
AAAATAATCG 
CTCCTGATGA 
CGGATCAAGG 
CTTATGATGC 
ATTCAGACAT 
ATGTTTACCT 
GGCAAAAACA 
TAGCCTTTGC 
CTGATATCAT 
ACATTCCTAA 
GTCGGGAAAA 
GAACAAACTG 
ACAGCAACAC 
TCTACAAACT 
CGCAAATCAC 
TTATCCCTTA 
GTTATCGTTC 
CTCCAGAACC 
CAATTGATGA 
AGGAGAATGG 
GCATTGATAG 
CTGACCTCCC 
TTCACCAAGA 
TGTTGGAACG 
CCTTCTTAGC 
ACACTGATGA 
ATATCTTTGA 
TGACCCATAG 
AGGCTTATGC 
ATACTGAGGC 
CACTTGATCG 
TCATACCTCA 
ATGAGGCACC 
AACATCCAAA 
AAAGAAACAA 
CTCAGACAGA 
CAGAGTCTCC 
CTCAGGTCGA 
AAATCCAGGA 
ATTTACTATT 
CTTTATTAAA 



AAGTGTCTGT 
TGTTTCCTAT 
GGTTAGCAAG 
TTATGTGACC 
CATCATCAGT 
TGTCAATGAA 
TAAGGATGCA 
AGAACATAGT 
ACGTTCACAG 
CGAAGATACG 
GAATGAGTTA 
TCTGTCAAAT 
GGTACCTTCT 
TAACAGTCAA 
GCCTTTGAGT 
AAGTCGAACC 
TGAACAAATG 
AAACCATTGG 
TAGTCCAAGT 
GAAATTGGTC 
AGTTTCTCGT 
CAAACTGGCC 
ATCTAGTGAT 
TTTACTTGAT 
ACTCAAGGAT 
TCCGATTCGT 
TGAGATTCAA 
TCCTCGTGAT 
CCACTGGATT 
TAAAGAGAAA 
AAAAGGAGCA 
TATGCCTTAC 
TTATGACCAT 
TAAGGGGTAT 
CGAACGTCCG 
AAATGGTCAA 
AAAACCTGAG 
AAAACCAACA 
GACTGAAAAG 
TCCAATTATC 
TGGCACCCAG 
GGAGAGTAAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2523 
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MKINKKYLAG SVATLVLSVC AYELGLHQAQ 
ENLTPDEVSK REQINAEQIV IKITDQGYVT 
EELLMKDPNY QLKDSDIVNE IKGGYVIKVN 
EINRQKQEHS QHREGGTSAN DGAVAFARSQ 
GDAYIVPHGD HYHYIPKNEL SASELAAAEA 
NTPRTNWVPS VSNPGTTNTN TSNNSNTNSQ 
QRHVESDGLI FDPAQITSRT ARGVAVPHGN 
IPLRYRSNHW VPDSRPEEPS PQPTPEPSPS 
KEAVRKVGDG YVFEENGVSR YIPAKNLSAE 
AKKTDLPSSD REFYNKAYDL LARIHQDLLD 
VSSDKVKLVD DILAFLAPIR HPERLGKPNA 
EDGYIFDPRD ITSDEGDAYV TPHMTHSHWI 
GLTPPSTDHQ DSGNTEAKGA EAIYNRVKAA 
GSLIIPHYDH YHNIKFEWFD EGLYEAPKGY 
HSDNGFGNAS DHVQRNKNGQ ADTNQTEKPS 
SEKPESPKPT EEPEEESPEE SEEPQVETEK 
KSNAKETLTG LKNNLLFGTQ DNNTIMAEAE 



TVKENNRVSY IDGKQATQKT 50 

SHGDHYHYYN GKVPYDAIIS 100 

GKYYVYLKDA AHADNVRTKE 150 

GRYTTDDGYI FNASDIIEDT 200 

FLSGRENLSN LRTYRRQNSD 250 

ASQSNDIDSL LKQLYKLPLS 300 

HYHFIPYEQM SELEKRIARI 350 

PQPAPNPQPA PSNPIDEKLV 400 

TAAGIDSKLA KQESLSHKLG 450 

NKGRQVDFEA LDNLLERLKD 500 

QITYTDDEIQ VAKLAGKYTT 550 

KKDSLSEAER AAAQAYAKEK 600 

KKVPLDRMPY NLQYTVEVKN 650 

TLEDLLATVK YYVEHPNERP 700 

EEKPQTEKPE EETPREEKPQ 750 

VEEKLREAED LLGKIQDPII 800 
KLLALLKESK (3EQ ID NOi 4) 840 

FIGURE 4 



ATGGAGAATA TAGACATGTT TAAATCAAAT CATGAGCGAA GAATGCGTTA TTCCATTCGT 60 

AAATTTAGTG TAGGAGTAGC TAGCGTAGCT GTTGCCAGTC TTTTTATGGG AAGTGTTGTA 120 

CATGCGACAG AGAAAGAGGG AAGTACCCAA GCAGCCACTT CTTTTAATAG GGGAAATGGA 180 

AGTCAGGCAG AACAACGTGG AGAACTCGAT TTAGAACGAG ATAAGGCAAT GAAAGCGGTC 240 

AGTGAATATG TAGGAAAAAT GGTGAGAGAT GCCTATGTAA AATCAGATAG AAAACGACAT 300 

AAAAATACTG TAGCTCTAGT TAACCAGTTG GGAAACATTA AGAACAGGTA TTTGAATGAA 360 

ATAGTTCATT CAACCTCAAA AAGCCAACTA CAGGAACTGA TGATGAAGAG TCAATCAGAA 420 

GTAGATGAAG CTGTGTCTAA ATTTGAAAAG GACTCATTTT CTTCGTCAAG TTCAGGATCC 480 

TCCACTAAAC CAGAAACTCC GCAGCCGGAA AATCCAGAGC ATCAAAAACC AACAACTCCA 540 

TCTCCGGATA CCAAACCAAG CCCTCAACCA GAAGGCAAGA AACCAAGCGT ACCAGACATT 600 

AATCAGGAAA AAGAAAAAGC TAAGCTTGCT GTAGTAACCT ACATGAGCAA GATTTTAGAT 660 

GATATACAAA AACATCATCT GCAGAAAGAA AAACATCGTC AGATTGTTGC TCTTATTAAG 720 

GAGCTTGATG AGCTTAAAAA GCAAGCTCTT TCTGAAATTG ATAATGTAAA TACCAAAGTA 780 

GAAATTGAAA ATACAGTCCA CAAGATATTT GCAGACATGG ATGCAGTTGT GACTAAATTC 840 

AAAAAAGGCT TAACTCAGGA CACACCAAAA GAACCAGGTA ACAAAAAACC ATCTGCTCCA 900 

AAACCAGGTA TGCAACCAAG TCCTCAACCA GAGGTTAAAC CGCAGCTGGA AAAACCAAAA 960 

CCAGAGGTTA AACCGCAACC AGAAAAACCA AAACCAGAGG TTAAACCGCA GCCGGAAAAA 1020 

CCAAAACCAG AGGTTAAACC GCAGCCGGAA AAACCAAAAC CAGAGGTTAA ACCGCAGCCG 1080 

GAAAAACCAA AACCAGAGGT TAAACCGCAG CCGGAAAAAC CAAAACCAGA GGTTAAACCG 1140 

CAGCCGGAAA AACCAAAACC AGAGGTTAAA CCGCAGCCGG AAAAACCAAA ACCAGAGGTT 1200 

AAACCGCAGC CGGAAAAACC AAAACCAGAG GTTAAACCGC AGCCGGAAAA ACCAAAACCA 1260 

GAGGTTAAAC CGCAGCCGGA AAAACCAAAA CCAGAGGTTA AACCGCAACC AGAAAAACCA 1320 

AAACCAGAGG TTAAACCGCA ACCAGAAAAA CCAAAACCAG ATAATAGCAA GCCACAAGCA 1380 

GATGATAAGA AGCCATCAAC TACAAATAAT TTAAGCAAGG ACAAGCAACC TTCTAACCAA 1440 

GCTTCAACAA ACGAAAAAGC AACAAATAAA CCGAAGAAGT CATTGCCATC AACTGGATCT 1500 

ATTTCAAATC TAGCACTTGA AATTGCAGGT CTTCTTACCT TGGCGGGGGC AACCATTCTT 1560 

GCTAAGAA^A GAATGAAATA G (SKQ ID NO: 5) 1581 

FIGURE 5 



4/33 



WO 00/39299 



PCT/CA99/0I218 



HBRRMRYSIR KFSVGVASVA VASLFMGSW HATEKEGSTQ 
AATSFNRGNG SQAEQRGELD LERDKAMKAV SEYVGKMVRD AYVKSDRKRH 
£2I25 L GMIKN *™* IVHSTSKSQL QELMmScS S J 
2™ S0S STKPETP Q PB NPEHQKPTTP SPDTKPSPQP EGKKPSVPDI 
NQE^KAKLA WTYMSKILD DIQKHHLQKE KHRQIVALIK ELDELKKqS 
SEIDNVNTKV EIENTVHKIF ADMDAWTKF KKGLTQDTPK EPGNkSsM 
KPGMQPSPQP EVKPQLEKPK PEVKPQPEKP KPEVKPQPEK PKPBVKK»£ 
KPKPEVKPQP EKPKPEVKPQ PEKPKPEVKP QPeSvk SS£££Sl 
KPQPEKPKPE VKPQPEKPKP EVKPQPEKPK PEVKPQPEKP SevSSIk 4 5 n 
SSSZSZ f? 1 ^ 3 ™ 1 ^KDKQPSNQ ASTNEKATNK S 00 
ISNLALEIAG LLTLAGATIL AKKRMK (SEQ ID NO: 6) fJJ 



50 
100 
150 
200 
250 
300 
3S0 
400 
450 
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ATGAAATTTA GTAAAAAATA TATAGCAGCT GGATCAGCTG TTATCGTATC fTW»^, 

■HHW i 

■■■■■■ I 

TAACGACAAG TAAGGAGCTC TCTTCAGCAT CTGATGGTTA TATTTTTAAT !!! 

CGCAAAAACA ^ ^^^0^7) ^GAAGAGCA AATTAAGGTG 1440 

1455 



FIGURE 7 
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QSAVIVSLSL CAYALNQHRS QENKDNNRVS YVDGSQSSQK 
!™ 72K QKEGIQAEQI VIKITDQGYV TSHGDHYHYY NGKVPYDALF 
SEELLMKDPN YQLKDADIVN EVKGGYIIKV DGKYYVYLKD AAHADNVRTK 
DEINRQKQEH VKDNEKVNSN VAVARSQGRY TTNDGYVFNP ADIIEDTGNA 

YIPKSDLSAS ELAAAKAHLA GKNMQPSQLS YSSTAsS 
QSVAKGSTSK PANKSENLQS LLKELYDSPP AQRYSESDGL VFDPAKIISR 
TPNGVAIPHG DHYHFIPYSK LSALEEKIAR mSpisGTGST ^SSZSS 

SSoS n^r SKEL SSASDGYI ™ "DIVBnS AY» 
™252- QPTIjPNNSLA TPSPSLPINP GTSHEKHEED GYGFDANRII 
AEDESGFVMS HGDHNHYFFK KDLTEEQIKV RKNI (SEQ ID NoTs) 



50 
100 
150 
200 
250 
300 
350 
400 
450 
484 
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ATGAAAGATT TAGATAAAAA AATCGAAGAA AAAATTGCTG GCATTATT a a a n RaTA ^ 

sips—"—" 

SSS£ SSSS Sir™ 8 s 
SEE S ^ =S SSSES 
=££ SSS2 SSSSS = = SSSS? 

====== 

SJSS TCCGCOTA 



(SEQ ID HO i 9) 
FIGURE 9 
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MKDLDKKIEE KIAGIMKQYG 
HKPVGIGHSH SNYELFKPEE 
QNFTLANGQK RVSFSFPPEL 
GVGNIANFEL DQPYLPGQTF 
QTIFYPFHAG DTYLRVNPQF 
IKLPIPKLNQ GTTRTAGNKI 
KPSILPQFKR NKAQENSKLD 
EVPTVDPVQE KVAKFAESYG 
FTGEAPQGNG ENKPSENGKV 
NSTDNGMLNP EGNVGSDPML 
VIFNMDGTIE LRLPSGEVIK 



VKRESIWNK EKNAIIYPHG 
GVAKKEGNKV YTGEELTNW 
EKKLGINMLV KLITPDGKVL 
KYTIASKDYP EVSYDGTFTV 
AVPKGTDALV RVFDEFHGNA 
PVTFMANAYL DNQSTYIVEV 
EKVEEPKTSE KVEKEKLSET 
MKLENVLFNM DGTIELYLPS 
STGTVENQPT ENKPADSLPE 
DPALEEAPAV DPVQEKLEKF 
KNLSDFIA (SEQ ID NOi 

FIGURE 10 



DHHHADPIDE 50 

NLLKNSTFNN 100 

EKVSGKVFGE 150 

PTSLAYKMAS 200 

YLENNYKYGE 250 

PILEKENQTD 300 

GNSTSNSTLE 350 

GEVIKKNMAD 400 

APNEKPVKPE 450 

TASYGLGLDS 500 

10) 528 



i 
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BVH3 WU2 
BVH3 RX1 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 A66 



CAYALNQHRSQENKDNNRVSYVDGSQSSQKSKNLTPDQVSQKEGIQAEQIVIKITDQGYV 60 

CA YALNQKRSQENKDNNR VS YVDGSQS SQKSENLT PDQVSQKEG I QAEQ I VI K I TOQGYV 60 

CAYALNQHRSQENKDNNRVSYVDGSQSSOKSENLTPDQVSQKEGIQAEQI VIKITDQGYV 6 0 

C A YALNQHRS QENKDNNR VS YVDG S QS SQ KS ENLT PDQVSQKEG I QAEQ I V I K I TDQG YV 60 

CAYALNQHRSQEMKDI^VSYVIXSSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYV 60 

CAYAXNQHRSQEMKDNlTOVSYVIXiSQSSQKSENLTPDQVSQKEGIQAEQI VIKITDQGYV 60 



BVH3 


WU2 


61 


BVH3 


RX1 


61 


BVH3 


JNR7/87 


61 


BVH3 


SP64 


61 


BVH3 


P4241 


61 


BVH3 


A66 


61 


BVH3 


WU2 


121 


BVH3 


RX1 


121 


BVH3 


JNR7/87 


121 


BVH3 


SP64 


121 


BVH3 


P4241 


121 


BVH3 


A66 


121 



TSHGDHYH YYNGKVPYDALF S E ELLMKD PNYQLKDADI VNE VKGG YI I KVDGKYYVYLKD 
TSHGDHYHYYNGKVPYDAiFSEELLMKDPNYQLKDADI VNEVKGGYI I KVDGKYYVYLKD 
TSHGDH YHYYNGKVPYDALFS EELLMKDPNYQLKDAD I WEVKGGYI I KVDGKYYVYLKD 
TSHGDHYHYYNGKVPYDALFSEELLMKDPbrtfQLK^ 

TSHGDHYHYYNGKVPYDALFSEELLMKD PNYQLKDADI VNEVKGGYI I KVDGKYYVYLKD 
TSHGDHYHYYNGKVPYDALPSEELLMKDPNYQLKDADIVNEVKGGYI I KVDGKYYVYLKD 



120 
120 
120 
120 
120 
120 



AAHADNVRTKDEIITOQKQEHVKDNEKVNSNVAVAJlSQGRYTThlDG YVPNPADI I EDTGNA 180 

AJUIADNVRTKDEIKRQKQEHVKDKEKVHSNVAVAIISQGRYTTNDGYVFNPADI I EDTGNA 180 

AAHADNVRTKDEINRQKQEHVKDITCKVN^^ 180 

AAHAIJNVRTKDEINRQKQEHVKDNEKW^^ 180 

AAHADNVRTKDEINRQKQBHVKDNEKVNSNVAVAJ^ " 

ft ft tin TNVTl m w i m nTMKA — • ■«'*****r»w*wv 1SV 

EDTGNA 180 



AAHADNVRTKDEINRQKQEHVKDNEKVNSNVAVARSQGRYTT^ 



BVH3 WU2 
BVH3 RX1 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 A66 



181 Y1VPHRGHYHYI ^^DLSASEIAAAKAHIiAGKNMQPSQLSYSSTA5DNNTOSVARGSTSK 240 

181 YIVPHGGHYHYIPKSDI*SASEIAAAKAHLJW3KNMQPSQLSYSSTASD \\l 
181 YI VPHGGHYHY I PKSDLSASEIAAAXAHI^KNMQ^^ 
181 YIVPHGGHYHYIPKSDLSASBI^^ 

181 YIVPHRGHYHYIPKSDI^ASEUUU^ \Vt 

YiyPHRGHYHYIPKSDLSASEIAAAKAHI^ \\\ 



181 



BVH3 WU2 
BVH3 RX1 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 A66 



ttl p^^?^*^ 300 

Itl p^^f^^ 300 

lit pawirc tot^ct't'bSt'w ^Q^^^*****VFD PAKI I S RTPNGVAI PHGDHYHF I P YS K 300 

24^ P ANJfQ ^iT^n c t't'^pt'v^^ ^^^^ ^ ^ ^* L VFD PAKI I S RT PNG VAI PHGDHYH F I P YS K 300 

241 PANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSK 300 

241 PANKSI^SI^KELYDSPSAQRYSESIX™ 3^ 



BVH3 WU2 
BVH3 RX1 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 A66 



™, ^^ EKI7 ^PISGTGSTVSTNAKPNEWSSI^SI^SNPSSLTTSKEL 360 

^A^EKIARRVPISGTGSTVSTNAKPNEWSSI^SI^SNPSSLTTSKEI^SAS^ 360 

T E f^? KI ^PISCm5STVSTNAKPNEVVSS^ 3^ 

l° n ] ^A^EKIAf^ISGTGSTVSTNAKPNEWSSI^SLSSNPSSLTTSKELSSASDGYITO 360 

301 ^ALEEKIARMVPISGTOSWSTNAKPN^ 3^ 



BVH3 WU2 
BVH3 RX1 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 AS 6 



361 



PraiVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSIATPSPSLPINPOTSHEKHEED 



361 *!?^EI ATAYI ™^ 

n™^*™!^ 420 



361 



420 
420 



P^IVEBTATAYIVRHGDHFHYI PKSNQ I GQ PTLPNNS LAT PS PSLP I NPGTSHEKHEBD 
P^IVBETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHEED 
P^IVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHEED 420 



361 
361 



420 



BVH3 WU2 
BVH3 RX1 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 A66 1 



\l\ ^f^"*?*^ 480 

HI °™FD^IIAED^ 480 

^^ANRIIAI^ESGFVMSra^^ 48O 

421 GYGFDANRIIAEDESGFVMSHGDHNHYFFKKDLTEEQIKAAQKHLE A 0 

III cyS^^ 80 

421 G YGFDANRI I AEBESGFVMSHGDHNHYFFKKDLTEEQI KAAQKHLEEVKTSHNGLDSLSS 480 
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BVH3 WU2 
BVH3 RX1 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 A66 



481 HEQDYPSNAKBMKDLDKKIEEKIAOIMKQYGVKRKSIVVNKEKNAIIYPHODHHHAbPID 540 
J" !! «?*^^^^ DK ^ I E EK I AG I MKQ YGVKRES I WNKEKNAI I Y PHGDHHHADP I D 540 

til y^SS?™^ lit 

481 HEQDYPGNAKEMKDLDKKI EEKI AG I MKQ YGVKRES I WNKEKNAI I Y PHGDHHHADP ID 
481 HEQDYPSNAKEMKDLDKKIEEKIM 



481 



BVH3 WU2 
BVH3 RX1 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 A66 



541 



541 5£SJ2^^ 



600 
600 
600 
600 
600 
600 



BVH3 HU2 
BVH3 RX1 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 A66 



^VSFSFPPE^KKI^INMLVK^TPDGKVI^ 



601 



'PYLPGQT 660 
PYLPGQT 660 
PYLPGQT 660 
PYLPGQT 660 
660 
660 



BVH3 MU2 
BVH3 RX1 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 A66 



III ^^JS™^^*^^*^ 
Al VZZ^™*™™^^ 

HI F^™££^^ 720 
661 P1CYTZASKDYPBVSYDGTFTVPTSXAYXHASQTX FVPFHAGDTYLRVNPQFAVPKGTEAL 720 



661 FKYTIASXDYPEVSYDGTFTVPTS 



720 
720 



661 



BVH3 WU2 
BVH3 RX1 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 A66 



721 S^^If^ff^f^^TAONKIPVTPMANAVU,, 

Dl 
Dl 
Dl 

721 ^!™^r^^ Ei ^»^^^^^^^^ 780 



721 JSSZE^ES!^ 780 

IN 



£E5^ 780° 



721 



780 



BVH3 HU2 
BVH3 RXL 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 A66 



fNSTSNSTL 840 

fKVEXEKLSETGNSTSNSTL 840 



Ill ^^^^^SILPQFKRNKAQENSKLDEKVEEPiCrSE] 

III ™?SS!S^ 840 

III E^5S2^ SILP ^^^ 840 

HI ££S5^ III 



BVH3 WU2 


841 


BVH3 RX1 


841 


BVH3 JNR7/87 


841 


BVH3 SP64 


841 


BVH3 P4241 


841 


BVH3 A66 


841 


BVH3 WU2 


901 


BVH3 RX1 


901 


BVH3 JNR7/87 


901 


BVH3 SP64 


901 


BVH3 P4241 


901 


BVH3 A66 


901 


i 

BVH3 WU2 


961 


BVH3 RX1 


961 


BVH3 JNR7/87 


961 


BVH3 SP64 


961 ; 


BVH3 P4241 


961 ; 


8VH3 A66 


961 J 



~assssasaEsssasr- 



EAPQGN 900 
FTGBAPQGN 900 



EEV^PVQBKVAKFABSYGMKLEN^ 900 
ff^^^ 900 



TQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNVGSDPM 



960 
960 
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BVHll 
BVH11 
BVHll 
BVHll 
BVHll 
BVHll 
BVHll 
BVHll 
BVHll 
BVHll 
BVHll 
BVHll 
BVHll 



2 SP64 
-2 JNR7/87 
-2 P4241 
•2 A66 
-2 WU2 
2 Rxl 
P4241 
WU2 
AS 6 
Rxl 

JNR7/87 
SP63 
SP64 



1 CSYELCRHQAGQVKKESNRVSYIDGDQAGQKAENLTPDBVSKREGINAEQIVIKITDQGY 60 

1 CSYELGRHQAGQVKKESNRVSYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGY 60 

1 CSYEI/3RHQAGQDKKESNRVAYIDGDQAGQKAENLTPDEVS KREGINAEQI VI KITDQGY 60 

1 CSYEW3RHQAGQDKKESNRVAYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGY 60 

1 CS YE LGRHQAGQDKKESNRVAYIDGDQAGQKABNLTPDEVSKREGIKAEQI VI KITDQGY 60 

1 CSYEIX5RHQAGQVKKESNRVSYIIX3DQAGQKAENLTPDEVSKREGINAEQI VI KITDQGY 60 

1 CS YEU3RHQAGQDKKESNR VA YIDGDQAGQKAENLTPDEVSKREG INABQI VI KITDQGY 60 

1 CS YEI/3RHQ^W3Q^KKESNR VAYI DGDQAGQKAENLT PDEVS KREG INAEQI VI KITDQGY 60 

1 CSYEI/jRHQAGQDKKESNRVAYIDGDQAG^KAENLTPDEVSKREGINAEQIVIKITDQGY 60 

1 CSYELGRHQAGQVKKESNRVSYIDGDQAGQKAENLTPDEVSKREGINAEQI VI KITDQGY 60 

1 CS YELGRHQAGQDKKESNRVAYI DGDQAGQKAENLTPDEVS KREGINAEQI VI KITDQGY 6 0 

1 CSYEU5RHQAGQVKKESNRVSYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGY 60 

I CAYEIX3LHQA -QTVKENNRVS YI DG KQATQKTENLTPDEVS KREG I NAEQI V I KI TDQG Y 59 
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61 VTSHGDHYHYYNGKVPYDAI ISEELLMKDPNYQLKDSDI VNEIKGGWI KVDGKYYVYLK 120 
61 VTSHGDHYHYYNG KV P YDAI I S EELLMKD PN YQLKDSD I VNE I KGG YV I KVDG K YYVYLK 120 
61 VTSHGDHYHYYNGKVPYDAI I SBELLMKDPNYQLKDSD I VNEIKGGWI KVNGKYYVYLK 120 
61 VTSHGDHYHYYNGKVPYDAI ISBELLMKD PNYQLKDSDI VNE I KGG YVI KVNGKYYVYLK 120 
61 VTSHGDHYHYYNG ICVP YDAI I SEE LLMKDPNYQIJa)SD I VNE I KGG WIKV^ i 20 
61 VTSHGDHYHYYNGKVPYDAIISEBUWKDPNYQLKDSDIWBI 120 
61 VTSHOTHYHYYNGKVTYDAIISEELL^ 120 
61 VTSHGDHYHYYNGKVPYDAIISBELLMKDPN^^^ 120 
^SHGDHYHYYNGKVPYDAIISEELU^PNYQUaJSDIVNBIK^^ 120 
61 VTSHGDHYHYYNGKVP YDAI I SBELLMKDPNYQLKDSD IVNE IKGG YVI KVDGKYYVYLK 120 
f, 1 ^f?™ 1 * 111 ^^ 120 
61 VTSHGDHYHYYNGKVPYDAI ISBBLLMKDPNYQLKDSDIVNEI KGGYVI KVDGKYYVYLK 120 
60 VTSHGDHYHYYNGKVPYDAI ISBBLU4KDPNYQLKDSDI VNEIKGGWI KV^ 119 
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121 DAAHADNIRTKEEIKRQKQEHSHNHNSRA DNAVAAARAQGRYTTDDGYI FNASDI IE 177 

121 DAAHADNIRTKEEIKRQKQEHSHNHGGGSN - - DQAWAARAQGRYTTDDG YI FNASDI I E 178 
121 DAAHADNIRTKEEIKRQKQEHSHNHGGGSN- -DQAWAARAQGRYTTDDGYIFNASDIIE 178 
121 DAAHADNIRTICEEIKRQRQBilSHNHGGGSN- -DQAWAARAQGRYTTDDGYIFNASDIIE 178 
121 DAAHADNIRTKEEIKRQKQEHSHNHGGGSN- -DQAWAARAQGRYTTDDGYIFNASDIIE 178 

121 DAAHADNIRTKEBIKRQKQERSHNHNSRA DNAVAAARAQGRYTTDDGYI FNASDI IE 177 

121 DAAHADNIRTKEEIKRQKQEHSHNHGGGSN- -DQAWAARAQGRYTTDDGYIFNASDIIE 178 
121 DAAHADNIRTKEEIKRQKQEHSHNHGGGSN- - DQAWAARAQGRYTTDDGYIFNASDIIE 178 
121 DAAHADNIRTKEEIKRQKQEHSHNHGGGSN- -DQAWAARAQGRYTTDDGYIFNASDIIE 178 

121 DAAHADNIRTKEEIKRQKQERSHNHNSRA DNAVAAARAQGRYTTDDGYI FNASDI I E 177 

121 DAAHADNIRTKEEIKRQKQERSHNHNSRA- - -DNAVAAARAQGRYTTDDGYI FNASDI IB 177 

121 DAAHADNIRTKEEIKRQKQERSHNHNSRA DNAVAAARAQGRYTTDDGYI FNASDI I E 177 

120 DAAHADNVRTKEEI NRQKQEHSQHREGGTSANDGAVAFARSQ^RYTTDDGYI FNASDI I E 179 
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178 DTGDAYIVPHGDHYHYIPICNELSASELAAAEAYWNGKQGSRPSSSSSYNANPVQPRLSEN 237 
^DAYIVPHGDHYHYIPKNELSASE^AEAYWNGKQGSRPSSSSSYN^ 238 

Hi ^DA^VPHGNHFHYIPICSPLSASEUUUVQAYWNGKQGSRPSSSSSH^ 238 
HI P^AYIVPHGNHF1OTPKSDLSASEIAAAQAYWNGKQGSRPSSSSSHNANPAQPRI>S^ 238 
HI ^^ AY ^ PR ^^IPKSDI^ASEIAAAQAYWNGKQGSRPSSSSSHNAN 238 
HI P^]^ DAYIvphot HYHYIPKSDLSASEIJUUVQAYVWGKQGSRPSSSSSHNANPAQPRIjSE^ 237 
HI ^DAYIVPHGNHFHYIPKSDLSASEIJ^QAYWNGKQGSRPSSSSSHNANPAQPRLSEN 238 

179 DTGDAYIVPHGNHFHYI PKSDLSAS ELAAAQA YWNGKQGSRPSS S S SHNANPAQPRLS EN 238 
HI ^ DAYIVPH ^^IPKSD^ASEU^QAYWNGKQGSRPSSSSSHNANPAQPRI^EN 238 
HI ^ DAY ^ PH G D HYHYIPKSDLSASEIJU^QAYWNGKQGSRPSSSSSHNANPAQPRLSEN 237 
HI ^^ YXVPHOTHYHY IPKNEI^AS£I^^ 237 

HI ^DAYIVPHGNHFHYIPKSDLSASEIAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSEN 237 

180 DTGDAYIVPHGDHYHYI PKNELSASEIAAAEAFLSGR^SNLSNLRTYRRQNSDNTPRTNWV 239 

********** *.***** .**##****#• * 
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238 HNLTVT PTYHQN QGEN I SSLLRELYAKPLSRRHVESDGL I FDPAQITS 285 

239 HNLTVT PTYHQN- QGEN I SSUJIELYAKPLSERHVESDGL I FDPAQITS 286 

239 HNLTVT PTYHQN QGENISSLLRELYAKPLSERHVESDGLI FDPAQITS 286 

239 HNLTVT PTYHQN QGENISSLLRELYAKPLSERHVESDGLIFDPAQITS 286 

23 9 HNLTVT PTYHQN QOENISSLLRELYAKPLSERRVESDGLIFDPAQITS 286 

238 HNLTVT PTYHQN QGENISSLLRELYAKPLSERHVESDGLIFDPAQITS 285 

239 HNLTVT PTYHQN QGENI SSLLRBLYAKPLSBRHVESDGLI PDPAQITS 286 

239 HNLTVT PTYHQN QGENISSLLRELYAKPLSERHVESDGLIFDPAQITS 286 

239 HNLTVT PTYHQN QGEN I SS LLRELYAKPLSERHVESDGL I FDPAQITS 286 

238 HNLTVT PTYHQN - QGENI SSLLRBLYAKPLSBRHVESDGLI PDPAQITS 285 

Ml !^™ PTYHQN " QGENISSLLRELYAKPLSERHVESDGLI FDPAQITS 285 
238 HNLTVT PTYHQN QGENI SSLLRBLYAKPLSBRHVESDGLI FDPAQITS 285 

240 PSVSNPGTTNTNTSNNSNTNSQASQSNDIDSLLKQLYKLPLSQRHVBSDGLIPDPAQITS 299 
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RTARGVAVPHGNHYHFI PYEQMSELEKRIARI I PLRYRSNHWVPDSRPEQPSPQSTPEPS 

RTARGVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHWVPDSRPEQPSPQSTPEPS 
RTARGVAVPHGNHYHFI PYEQMSELEERIARI T PLH YR .QMWwvoncu Dpnoeon D0 



ITPEPS 345 
346 
PS 342 
PS 342 



»«»rnunn wr i FXfcwnsKiiEKXXARII PLRYRSNHWVPDSRPEQPSPQSTPE 

RTARGVAVPHGNHYHFI PYEQMSELEERIARI I PLRYRSNHWVPDSRPEQPSPQ 

RTARGVAVPHGNHYHFI PYEQMSELEERIARI I PLRYRSNHWVPDSRPEQPSPQ 

RTARGVAVPHGNHYHFI PYEQMSELEERIARI I PLRYRSNHWVPDSRPEQPSPQ 

RTANG VA VPHGDHYHF I PYSQLSPLEEKLARI I PLRYRSNHWVPDSRPEQPSPQSTPE 

RTARGVAVPHGNHYHFI PYEQMSELEERIARI I PLRYRSNHWVPDSRPEQPSPQ 

RTARGVAVPHGNHYHFI PYEQMSELEERIARI I PLRYRSNHWVPDSRPEQPSPQ 

RTARGVAVPHGNHYHFI PYEQMSELEERIARI I PLRYRSNHWVPDSRPEQPSPQ PS 

RTANGVAVPHGDHYHFIPYSQI^PI^EKIJUIIIPUIYRSNHWVPDSRPEQPSPQSTPEPS 
RTARGVAVPHGNHYHFI PYEQMSELEKRIARI I PLRYPSMmnronoo dot oo^n,,^ 
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2 SP64 346 
-2 JNR7/87 347 

-2 P4241 343 

-2 A66 343 

-2 WU2 343 

■2 Rxl 346 

P4241 343 

WU2 343 

A66 343 

Rxl 346 

JNR7/B7 346 

SP63 346 

SP64 360 



PSLQPAPNPQPAPSNPIDE KL VKEA VRKVGDG YVFE ENGVSR Y I PAKDLS AET AAG I D S K 405 
PS PQ PAPNPQPAPSNPI DEKLVKEAVRKVGDGYVFEENGVSRYI PAKDLSAETAAG I DS K 406 
PSPQPAPNPQPAPSNPIDEKLVKEAVRKVGDGYVFEENGVSRYIPAKDLSAETAAGIDSK 402 
PSPQPAPNPQPAPSNPIDEKLVKEAVRKVGDGYVFEENGVSRYI PAKDLSAETAAGIDSK 402 
PSPQPAPNPQPAPSNPIDEKLVKEAVRKVGDGYVFEENGVSRYIPAKDLSAETAAGIDSK 402 
PS PQPAPNPQPAPSNPIDEKLVKEAVRKVGDGYVFBENGVPRYI PAKDLSAETAAGIDSK 405 
PSPQPAPNPQPAPSNPIDBKL VKEA VRKVGDG YVFEENGVSRYI PAKDLSAETAAGIDSK 402 
PSPQPAPNPQPAPSNPIDEKL VKEA VRKVGDG YVFEENGVSRYI PAKDLSAETAAGIDSK 4 02 
PS PQPAPNPQPAPSNPIDEKLVKEAVRKVGDG YVFEENGVSRYI PAKDLSAETAAGIDSK- 402 
PSPQPAPNPQPAPSNPIDEKL VKEA\niKVGDGYVFEENGVPRYIPAKDLSAETAAGIDSK 405 
PSP QPAPSNPIDEKLVKEAVRKVGDGYVFEENGVSRYI PAKDLSAETAAGIDSK 399 
PSPQSAPNPQPAPSNPIDEKLVKEWRKVGDGYVFEKNGVSRYI PAKNLSAETAAGIDSK 405 
PSPQPAPNPQPAPSNPIDEKLVKEAVRKVGDGYVFEENGVSRYIPAKNLSABTAAGIDSK 419 
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406 liAKQESLSHKIX^UCKTDLPSSDREFYNKAYDLLARIHQDLL^ 465 

407 L^QESLSHKI/SAKKTDLPSSDREFYNKAYDLLARIHQDLLDNKGRQVDFEA 466 
403 LAKQBSLSHKIGTXlCrDLPS SDRBFYNKAYDLLARIHQDLL^ 462 
403 LAKQBSLSHKIXnTCKTDLPSSDREFYNKAYDLLARIHQDIiLDNTORQVDF 46 2 
403 I^UCQBSLSHKI/nTaCTDLPSSDREFYNKAYDLLARIHQDLIJJNKGRQV^ 462 
406 IAKQESI^KI/SAKKTDLPSSDREFYNIOVTO^ 465 
403 IJUCQBSLSHKIXJTKICTDLPSSDREFYNKAYDIjLARI HQDLLDNTO 462 
403 IJUCQESLSHKIOTKKTDLPSSDREFYNKAYDUARIHQDL^ 462 
403 LAKQESLSHKLGTKKTDLPSSDREFYNKAYDLIJUIIHQDLI^NK^ 462 
406 IAKQESLSHKI^AKKTDLPSSDREFYNKAYDLLJ^IHQDLIJDNKGRQVDFEAIiDNLLER^ 465 
400 LAKQBSIiSHKIKSAKKTDLPSSDREFYNKAYDLLARIHQDIJjDNl^ 459 
406 LAKQESLSHKI/5A1CKTDLPSSDREFTNKAYDLLARIHQDLLDNRGRQVD 46S 
420 UOCQESLSHKLGAKKTDLPSSDREFYNKAYDLIARIHQDLLJ5NKGRQVDFE^I^ 479 
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466 KDVSSDKVKLVDDILAFLAPIRHPERIjGOT S25 

467 KDVPSDKVKLVDDIIAPLAPIRHPBRIXJKPHAQITYTDDEIQVAKIJW3 526 
463 KDVSSDKVKLVEDILAFLAPIRHPEIILGKPNSQITYTDDEIQVAKIjAGK^ 522 
463 KDVSSDKVKLVEDIIAFIAPIRHPERLGKPNSQITYTDDEIQVAKIAGKYTTED^ 522 
463 KDVSSDKVKLVEDIIAFLAPIRHPEJUXSKPNSQITYTDDBIQVAKLAGK^ 522 
466 KDVSSDKVKLVDDILAfTAPIRHPERIjGKPNAQI S25 
463 KDVSSDKVKLVED ILAFLAFIRHPERLGKPNSQITYTDDEIQVAICLA^ FD P 522 
463 KDVSSDKVKLVEDILAFLAPIRHPERLGKPNSQITYTDDEIQVAKIAGKY 522 
463 KDVSSDKVKLVEDILAFLAP IRHPERLGKPNSQITYTDDEIQVAKIAGK^ PD P 522 
466 KDVSSDKVKLVDDI LAFLAPI RHPERLGKPNAQITYTDDEIQVAlOiAGKYTTEDGYI FD P 525 
460 KDVSS DKVKLVDDI LAFLAP I RH PERLGKPNAQI TYTDDB I QVAKLAG KYTTEDG YI FD P 519 
466 EDVPSDKVKLVDDI1AFIAPIRHPERI/3KPNAQITY1T3DEIQVAKIAGKYTTEDGYIFDP 525 
480 KDVSSDKVKLVDDI LAFIAP I RH PERLGKPNAQI TYTDDB I QVAKLAGK^^ 539 
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526 RDITSDEGDAYVTPHMTHSHW I KKDSLS EAERAAAQAYAKEXGLTP PSTDHQDSGNTEAK 585 

527 RDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAOAYAKEKGLTPPSTDHQDSGNTEAK S86 
S23 RDITSDEGDAYVTPHMTHSHWIKKDSI^SEABRAAAQAYAKEKGLTPPSTDHRDSGNTEAK 582 
523 RDITSDEGDAYVTPHJfTHSHWIKKDSI^EAERAAAQAYAXEKGLTPPSTDHQDSGNTEAK 582 
S23 RD ITSDEGDAYVTPHMTHSHW I KKDSLSEAERAAAQAYAKBKGLTP PSTDHQDSGNTEAK 582 
526 RD I TS D EGD A YVTPHMTHS HW I KKDSLS EAERAAAQAYAKEKGLT P P STDHQDSGNT EAK 585 
523 RDITSDEGDAYVTPHMTHSHWIKKDSI^EAERAAAQAYAJCEKGLTPPSTDHQDSGNTEAK 582 
523 RDITSDEGDAYVTPHMTHSHW I KJCDSI^EAERAAAQAYAKEKGLTP PSTDHQDSGNTEAK S82 
523 RDITSDEGDAYVTPHhrTHSHWIKKDSLSEAERAAAQAYAiCEKGLTPPSTDHQDSG 582 
526 RDITSDEXjDAYVTPHMTHSHWI K1CDSLSEABRAAAQAYAIOSKGLTPF 585 
520 RD I TS D EGDA YVT PHMTHSHW I KKDS LS EAE RAAAQA YAKEKG LT P P STDHQDSGNTEAK 579 
526 RD I TSDEGDAYVT PHMTHSHW I KKDS LSEAERAAAQAYAKEKGLTP PSTDHQDSGNTEAK 585 
540 RDITSDEGDA YVTPHMTHS HWIKKDSLSEAEKAAAQAYAKEKGLTPPSTDHQDSGNTEAK 599 
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-2 SP64 586 GAEAI YNRVKAAKfCVPLDRMPYNI^YTVEVKNGSLI IPHYDHYHNI KFEWFDEGLYEAPK 645 
-2 JNR7/87 587 GAEAI YNRVKAAiCKVPLDRMPYNLQYTVEVKNGSLI I PHYDHYHNI KFEWFDEGLYEAPK 646 

-2 P4241 583 GAEAIYNRVKAAKKVPIjDRMPYNLQYTVEVKNGSLIIPHYDHYKNIK^ 642 

-2 A66 583 GAEAI YNRVKAAKKVPLDRMP YNLQYTVEVKNGS LI I PHYDHYHNI KFEWFDEGLYEAPK 642 

-2 WU2 583 GAEAI YNRVKAAKKVPLDRMP YNLQYTVEVKNGSLI I PHYDHYHNI KFEWFDEGLYEAPK 642 

-2 Rxl 586 GAEAI YNRVKAAKKVPLDRMP YNLQYTVBVKNGSLI I PHYDHYHNI KFEWFDEGLYEAPK 645 

P4241 583 GAEAI YNRVKAAKKVPLDRMPYNLQYTVBVKNGSLI I PHYDHYHWI KFEWFDEGLYEAPK 642 

WU2 583 GAEAI YNRVKAAKKVPLDRMPYNLQYTVEVKNGSLI IPHYDHYHNI KFEWFDEGLYEAPK 642 

A66 583 GAEAIYNRVKAAXKVPLDRM P YNLQYTVEVKNGSLI I PHYDHYHNI 642 

Rxl 586 GAEAJ YNRVKAAKKVPLDRMP YNI^YTVBVKNGSLI I PHYDHYHNI KFEWFDEGLYEAPK 64 5 

JNR7/87 580 GAEAI YNRVKAAKKVPLDRMP YNLQYTVBVKNGSLI I PHYDHYHNI KFEWFDEGLYEAPK 639 

SP63 586 GAEAI YNRVKAAKKVPLDRMPYNLQYTVEVKNGSLI I PHYDHYHNI KFEWFDEGLYEAPK 645 

SP64 600 GAEAI YNRVKAAKKVPLDRM P YNLQYTVEVKNGSLI I PHYDHYHNI KFEWFDEGLYEAPK 659 

2 SP64 646 GYSLEDIiLATVKYYVEHPNERPHSDNGFGNASDHVRKNK ADQDSK 690 

2 JNR7/87 647 GYTLBDLIATVKYYVBHPNERPHSDNGFGNASDHVRKNK VDQDSK 691 

2 P4241 643 GYTI^DLIATVKYYVBHPNERPHSDNGFGNASDHVRKNK ADQDSK 687 

2 A66 643 GYTLBDLLATVKYYVEH PNBRPHSDNGFGNASDHVRKNK ADQDSK 687 

2 WU2 643 GYTLBDLLATVKYYVBHPNERPHSDNGFGNASDHVRKNK - ADQDSK 687 

2 Rxl 646 GYSI«BDUATVKYYVEHPNERPHSDNGFGNASDHVQRNKN 705 

P4241 643 GYTLBDLLATVKYYVBHPNERPHSDNGFGNASDHVRKNK ADQDSK 687 

W2 643 GYTLED LLATVKYYVEH PNER PHSDNG FGNAS DHVRKNK ADQDSK 687 

A66 "3 GYTLBDLLATVKYYVBHPNERPHSDNGFGNASDHVRKNK ADQDSK 687 

Rxl 646 GYSLBDLIATVKYYVEHPNERPHSDNGFGNASDHVQRNK - NGQ 687 

JNR7/87 640 GYS LED LLATVKYYVEHPNER PHSDNG FGNAS DHVQRNK NGQ 681 

SP63 646 GYTLBDLLATVKYYVBHPNERPHSDNGFGNASDHVQRNK NGQ 687 

SP64 660 GYTLED LIATVKYYVEHPNER PHSDNG FGNAS DHVQRNK NGQ 701 
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^ & x- A nrtdiJC ( Aj^nA^i J WFSAUNJ J YKPSTirrEETEEEAEim , DEAEI PQ 
r^oy^nwnVSBPTHPESDBKBNHAGI^PSADNLYKPSTDTEETEEEAEDlTDEAEIPQV 
688 PDEDKGHDEVSEPTHPESDEKENHAGLNPSADNLYKPSTDTEETEEEAEDTTDEAEIPQV 74 7 
688 PDEDKGHDEVSEPTHPESDEKENHAGLNPSADNLYXPSTOTEETEKEAEDTTDEAEIPQV 747 
706 PEEDKEHDEVSEPTHPBSDEKENHVGLNPSADNLYKPSTDTEETEEEAEDTTDEAEIPOV 765 
688 PDEDKGHDEVSEPTHPESDEKENHAGLNPSAD^YKPSTDTEETEBEAEDTTDEAEIPOV 747 
688 PDEDKGHDEVSEPTHPESDEKENHAGLNPSADNLYKPSTDTEETEEEAEDTTDEAEIPQV 747 
688 PDEDKGHDEVSEPTHPESDEKENHAGLNPSADNLYKPSTDTBETEEEAEDTTDEAEIPOV 747 
688 ADTNQTEKPNE EKPQTEKPEEETPREEKPQSEKP ES PKPTEE PEEES PEES PEES EE POV 747 
682 ADTNQTEKPNEEKPQTEKPEEETPREEKPQSEKPESPKPTEEPEEESPEESPEESEEPOV 741 

688 ADTNQTEKPS EEKPQTEKPEEET PREEKPQS EKPES P KPTEE PEEESPEES EE PQ V 743 

702 ADTNQTEKPS EEKPQTEKPEEETPREEKPQSEKPESP KPTEE PEEES PEES EE PQV 757 

* * * • * * *. * . . * ** #...*.**** 

HI ^f^^^AEAIJ.EKVTDPSIRQNAMBTLTC 01o 
IA f^V^^^AEALI^fCVTDPSIRQNAMETLTGI^SI^ 811 
748 EHS VI MAKI ADAEALLBKVTDPS I RQNAMETLTGLKSSLI*IiC?rKDNNTI SAEVDSLLALIi 807 
748 BHSVINAKIADABALLEKVTDPSIRQNAMET^ 807 
1\* ^f^IADAEAI^KVTTDPSIRQNAMETLTC 807 

111 ™fX^^*^ B ^ P ^^ 807 

111 ™f^^^^^ 807 

III 807 

III ^S^EfJEESf?^ 807 

^^ KIA ^^KIQ N PIIKSNAKBTLTGI^^ eOl 
ETBKVEEKLREAEDLICKIODPI IKSNAKETLTGIJCMOT.tBYSTrinMKppTMiiT?* owt 




BVHXl 
BVH11 
BVH1X 
BVHXX 
BVHXX 
BVHXX 
BVHXl 
BVH1X 
BVH11 
BVH11 
BVH11 
BVH11 
BVH11 



-2 SP64 
-2 JNR7/87 
-2 P4241 
-2 A66 
-2 WU2 
-2 Rxl 

P4241 

WU2 

A66 

Rxl 

JNR7/87 
SP63 
SP64 



8XX KESQPAPIQ 819 
812 KESQPAPIQ 820 
808 KKSQPAPIQ 816 
808 KKSQPAPIQ 816 
808 KKSQPAPIQ 816 
826 KESQPAPIQ 834 
808 KESK 811 
808 KESK 8XX 
808 KESK 811 
808 KESK 811 
802 KESK 805 
804 KESK 807 
818 KESK 821 
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^"CCTTGT CGGQTAAQTT CCOACCCGCA CGAAAGGCOT AATGATTTGG GCACTGTCTC 
J CGaTGAAAT TTTAGTACCT GTGAAQATGC AGGTTAcSg 

n^™ CC ATGGAGCTTT ACTGCAGTTT GATATTGAGT GTCTGTACCA CATCTaSS 
tl^T* TCGGGACGCC AGTTTCGAAG GAGACGCTGT ESSES 

CCCTTGTGTT ATGGCCACTC TAACCCAGAT AGGTGATCCC TATCGQAGAC AGTGTCTGAr 
ACTGGGGCGG T ^CCTCCTA AAAGGTAACG GAGGCGCCCA £E££SJ 
GGAAATCArr CGCAGAGTOT AAAGGTATAA GGGAGCTTGA C^GCgIgS 
GCAGGGACGA AAGTOGGGCT TAGTGATCCG GTGGTTCCGT ATCGAaSS 

cggataaaag ctaccctggg gataacaggc ttatctcccc ££££££ 

ESSS 22525 ACCTCGATGT CGGCTCGTCG CATCCxSS 

ATGGTTAAAT AACCGOTTTA GAAAACTATT TAATAAAGTA AAAGAAGTTG 
TCATCArTTA TTGAAATGAG GGATTTATGA AATTTAGTAA IJiKSSS 
^ GCTGTTAT CGTATCCTTG AGTCTATGTG CCTATGCACT AA^SgS 

SSSS^ ™ TAAGGA ^aatcgt gtctcttato togatcgcS JXSXS 

SSJJS ££^r ACCAGACCAG GTTAGCCAGA AAGAAGgSJ SSSS2 
T^r^f^ TCAAAATTAC AGATCAGGGC TATGTAACGT CACACGGTGA CCACTATCAT 
GGAAAGTTCC TTATGATGCC CTCTTTAGTG AAGAACTCTT SSSSJ 
a^^n™ AACTTAAAGA CGCTGATATT GTCAATGAAG TCAAGGGTGG TrlSJSiJ 
^™ A ™ GAAAATATTA TGTCTACCTG AAAGATGCAG CTCATGcJS SSScGA 

iSSSSSS ^SSSZ nS^^ GAACATCTCA AAGaXSS 2E5S 
AATcSgS^ 2J2!^ ^TCTCAGGGA CGATATACGA CAAATGATGG TTATGTCTTT 
nZn^Z* ATATTATCGA AGATACGGGT AATGCTTATA TCGTTCCTCA TGGAGGTCAC 

S CCAAAA ° CQATTrATCT GCTAGTGAAT TAgSSSS 
SSSKi AAAATATGCA ACCGAGTCAG TTAAGCTATT CTTCAACAGC TAgSSJ 
CTGTAGCAAA AGGATCAACT AGCAAGCCAG CAAATAAATC TOaSJSS 
TGAAGGAACT CTATGATTCA CCTAGCGCCC AACGTTAGAG ^JJSSS 
TTOACCCTGC TAAGATTATC AGTCGTACAC CAAATgS toJSISSS 
£^™ G ATTACCACTT TATTCCTTAC AGCAAGCTTT CTGCTTTAGA AgSSS 
SSt^ ™ CCTATCAa tggaactggt TCTACAGTTT CTACAAATCC iSJ^JS 
2SS2^ CTAGTCTAGa CAGTCTTTCA AGCAATCCTT CTTCTTTAAC GACaSJS 
GAGCTCTCTT CAGCATCTGA TGGTTATATT TTTAATCCAA AAGATATCGT TCAA^I^ 
!S5£^ ATATTGTAAG ACATGGTGAT CATTTCCATT 
™ CGACTCTTCC AAACAATAGT CTAGCAACAC CTTCTCCATC tSJ^SJJ 
CTTCACATGA GAAACATGAA GAAGATGGAT ACGGATTtS JSStcS 
AAGATGAATC AGGTTTTGTC ATGAGTCACG GAGAcScAA TcS^mc 

TTCAAGAAGG ACTTGACAGA agagcaaatt aaggctgcgc aaaaacattt agaSaIS 

SSZ^ ^ TG0ATT TCATCTCATO JJSJSS SSS 

GCCAAAGAAA TGAAAGATTT AGATAAAAAA ATCGAAGAAA AAATTGCTGG CAWAtcJJI 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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1200 
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1380 
1440 
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1680 
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3120 
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3300 
3360 



15/33 



SUBSTITUTE SHEET (RULE 26) 



WO 00/39299 



PCT/CA99/01218 



^™« T ° TCAAACGTGA AAQTATTOTC GTGAATAAAG AAAAAAATGC GATTATTTAT ,^,„ 

S52K! ATCACCATCA TGCAGATCCG ATTGATGAAC ATAAACCGGT TgSSJ 3?" 

GTAACTATOA ACTGTTTAAA CCCGAAGAAG GAGTTGCTAA AAAaSaISg? 

AATAAAQTTT ATACTGGAGA AGAATTAACG AATGTTGTTA ATTTGTTAAA AAATAQTACG linn 

TTTAATAATC AAAACTTTAC TCTAGCCAAT GGTCAAAAAC GCQTTTOTTT 36 °° 
CCTGAATTGQ AGAAAAAATT AGGTATCAAT aSSSJ^ S£aI££ 

AAAGTATTGO AGAAAGTATC TGGTAAAGTA TTTGGAGAAG GAGTAGGGAA JaSSSSS llll 

TTTQAATTAG ATCAACCTTA TTTACCAGGA CAAACATTTA AGTATACTAT ooeSSiif 8 ° 

gattatccag aagtaaqtta toatggtaca tttacagSc SSSS agS™ 3840 

ATGGCCAOTC AAACGATTTT CTATCCTTTC CATGCAGGGG AT^SSS JSJi^ 3900 

sss—ss li 

GCTTATTTGO ACAATCAATC GACTTATATT GTGGAAGTAC CTATrrrrrA ^nXnTTvi 

■■■■■■ 1 

SSis^SS is 

TtSJJS ^aSSJS TGACTTTGGG AGAATTTCAT 4S80 

AGTTAATT (SEQ IDNO TACAACTTAA AAAGAGGTGG AATATTTACT 5040 

5048 
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CAGAGATCTT AGTGAATCAA ATATACTTAA 
AATATCTAGC TGGGTCAQTA GCTACACTTG 
TGCATCAAGC TCAAACTGTA AAAGAAAATA 
CGACGCAAAA AACGGAGAAT TTGACTCCTG 
CCGAAGAAAT CGTCATCAAG ATTACGGATC 
ATCATTACTA TAATGGCAAG GTCCCTTATG 
AAGATCCGAA TTATCAGTTG AAGGATTCAG 
TCATTAAGGT AAACGGTAAA TACTATGTTT 
TCCGTACAAA AGAAGAAATC AATCGGCAAA 
GGACTTCAGC AAACGATGGT GCGGTAGCCT 
ATGATGGTTA TATCTTCAAT GCATCTGATA 
TTCCTCATGG AGATCATTAC CATTACATTC 
CTGCTGCAGA AGCCTTCCTA TCTGGTCGGG 
GACAAAATAG CGATAACACT CCAAGAACAA 
CTACAAATAC TAACACAAGC AACAACAGCA 
ACATTGATAG TCTCTTGAAA CAGCTCTACA 
CTGATGGCCT TATTTTCGAC CCAGCGCAAA 
TCCCTCATGG TAACCATTAC CACTTTATCC 
GAATTGCTCG TATTATTCCC CTTCGTTATC 
CAGAAGAACC AAGTCCACAA CCOACTCCAG 
ATCCTCAACC AGCTCCAAGC AATCCAATTG 
AAGTAGGCGA TGGTTATGTC TTTGAGGAGA 
ATCTTTCAGC AGAAACAGCA GCAGGCATTG 
CTCATAAGCT AGGAGCTAAG AAAACTGACC 
AGGCTTATGA CTTACTAGCA AGAATTCACC 
TTGATTTTGA GGCTTTGGAT AACCTGTTGG 
TCAAGTTAGT GGATGATATT CTTGCCTTCT 
GAAAACCAAA TGCGCAAATT ACCTACACTG 
GCAAGTACAC AACAGAAGAC GGTTATATCT 
GGGATGCCTA TGTAACTCCA CATATGACCC 
CTGAAGCTGA GAGAGCGGCA GCCCAGGCTT 
CGACAGACCA TCAGGATTCA GGAAATACTG 
GCGTGAAAGC AGCTAAGAAG GTGCCACTTG 
TAGAAGTCAA AAACGGTAGT TTAATCATAC 
TTGAGTGGTT TGACGAAGGC CTTTATGAGG 
TGGCGACTGT CAAGTACTAT GTCGAACATC 
TTGGTAACGC TAGCGACCAT GTTCAAAGAA 
CGGAAAAACC AAGCGAGGAG AAACCTCAGA 
AAGAGAAACC ACAAAGCGAG AAACCAGAGT 
AATCACCAGA GGAATCAGAA GAACCTCAGG 
GAOAGGCTOA AGATTTACTT GGAAAAATCC 
AGACTCTCAC AGGATTAAAA AATAATTTAC 
TGGCAGAAGC TGAAAAACTA TTGGCTTTAT 
TTCTAACTCC TAAAAACAGG ATAGGAGAAC 
GTTCTAG (SED ID NO : 12) 



GAAAAGAGGA AAGAATGAAA 
TTTTAAGTGT CTGTGCTTAT 



ATCGTGTTTC CTATATAGAT 
ATGAGGTTAG CAAGCGTGAA 
AAGGTTATGT GACCTCTCAT 
ATGCCATCAT CAGTGAAGAG 
ACATTGTCAA TGAAATCAAG 
ACCTTAAGGA TGCAGCTCAT 
AACAAGAACA TAGTCAGCAT 
TTGCACGTTC ACAGGGACGC 
TCATCGAAGA TACGGGCGAT 
CTAAGAATGA GTTATCAGCT 
AAAATCTGTC AAATTTAAGA 
ACTGGGTACC TTCTGTAAGC 
ACACTAACAG TCAAGCAAGT 
AACTGCCTTT GAGTCAACGC 
TCACAAGTCG AACCGCCAGA 
CTTATGAACA AATGTCTGAA 
GTTCAAACCA TTGGGTACCA 
AACCTAGTCC AAGTCCGCAA 
ATGAGAAATT GGTCAAAGAA 
ATGGAGTTTC TCGTTATATC 
ATAGCAAACT GGCCAAGCAG 
TCCCATCTAG TGATCGAGAA 
AAGATTTACT TGATAATAAA 
AACGACTCAA GGATGTCTCA 
TAGCTCCGAT TCGTCATCCA 
ATGATGAGAT TCAAGTAGCC 
TTGATCCTCG TGATATAACC 
ATAGCCACTG GATTAAAAAA 
ATGCTAAAGA GAAAGGTTTG 
AGGCAAAAGG AOCAGAAGCT 
ATCGTATGCC TTACAATCTT 
CTCATTATGA CCATTACCAT 
CACCTAAGGG GTATACTCTT 
CAAACGAACG TCCGCATTCA 
ACAAAAATGG TCAAGCTGAT 
CAGAAAAACC TGAGGAAGAA 
CTCCAAAACC AACAGAGGAA 
TCGAGACTGA AAAGGTTGAA 
AGGATCCAAT TATCAAGTCC 
TATTTGGCAC CCAGGACAAC 
TAAAGGAGAG TAAGTAAAGG 
GGGAAAACGA AAAATGAGAG 



ATCAATAAAA 
GAACTAGGTT 
GGAAAACAAG 
GGAATCAACG 
GGAGACCATT 
CTCCTCATGA 
GGTGGTTATG 
GCGGATAATG 
CGTGAAGGAG 
TACACCACAG 
GCCTATATCG 
AGCGAGTTGG 
ACCTATCGCC 
AATCCAGGAA 
CAAAGTAATG 
CATGTAGAAT 
GGTGTAGCTG 
TTGGAAAAAC 
GATTCAAGAC 
CCTGCACCAA 
GCTGTTCGAA 
CCAGCCAAGA 
GAAAGTTTAT 
TTTTACAATA 
GGTCGACAAG 
AGTGATAAAG 
GAACGTTTAG 
AAGTTGGCAG 
AGTGATGAGG 
GATAGTTTGT 
ACCCCTCCTT 
ATCTACAACC 
CAATATACTG 
AACATCAAAT 
GAGGATCTTT 
GATAATGGTT 
ACCAATCAAA 
ACCCCTCGAG 
CCAGAAGAAG 
GAAAAACTGA 
AATGCCAAAG 
AATACTATTA 
TAGCAGCATT 
CAGAATGTGA 
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420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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GGGTCTTAAA ACTCTGAATC 
AAATCTGGAA GAAAATATGA 
TCAATAAAAA ATATCTAGCA 
AACTTGGTCG TCACCAAGCT 
ATGGTGATCA GGCTGGTCAA 
AGGGGATCAA CGCCGAACAA 
ATGGAGACCA TTATCATTAC 
AACTTCTCAT GAAAGATCCG 
AGGGTGGCTA TGTGATTAAG 
ATGCGGACAA TATTCGGACA 
ATCATAACTC AAGAGCAGAT 
CGGATGATQQ GTATATCTTC 
TCGTTCCTCA CGGCGACCAT 
TAGCTGCTGC AGAAGCCTAT 
GTTATAATGC AAATCCAGTT 
CAACTTATCA TCAAAATCAA 
AACCCTTATC AGAACGCCAT 
CAAGTCGAAC CGCCAGAGGT 
ATGAACAAAT GTCTGAATTG 
CAAACCATTG GGTACCAGAT 
CTAGTCCAAG TCTGCAACCT 
AQ AAAT TGGT CAAAGAAGCT 
GAGTTTCTCG TTATATCCCA 
GCAAACTGGC CAAGCAGGAA 
CATCTAGTGA TCGAGAATTT 
ATTTACTTGA TAATAAAGGT 
GACTCAAGGA TGTCTCAAGT 
CTCCGATTCG TCATCCAGAA 
ATGAGATTCA AGTAGCCAAG 
ATCCTCGTGA TATAACCAGT 
GCCACTGGAT TAAAAAAGAT 
CTAAAGAGAA AGGTTTGACC 
CAAAAGGAGC AGAAGCTATC 
GTATGCCTTA CAATCTTCAA 
ATTATGACCA TTACCATAAC 
CTAAGGGGTA TAOTCTTGAG 
ACGAACGTCC GCATTCAGAT 
AGGCAGACCA AGATAGTAAA 
CTCACCCTGA ATCTGATGAA 
TTTATAAACC AAGCACTGAT 
AGGCTGAAAT TCCTCAAGTA 
CCTTGCTAGA AAAAGTAACA 
GTCTAAAAAG TAGTCTTCTT 
ATAGTCTCTT GGCTTTGTTA 
(SBQ ID NO : 13) 



CTTTAGAGGC 
GTGTTCTAGC 
GGTTCAGTGG 
GGTCAGGTTA 
AAGGCAGAAA 
ATTGTTATCA 
TATAATGGCA 
AATTATCAGT 
GTAGACGGAA 
AAAGAAGAGA 
AATGCTGTTG 
AATGCATCTG 
TACCATTACA 
TGGAATGGGA 
CAACCAAGAT 
GGGGAAAACA 
GTAGAATCTG 
GTAGCTGTCC 
GAAAAACGAA 
TCAAGACCAG 
GCACCAAATC 
GTTCGAAAAG 
GCCAAGGATC 
AGTTTATCTC 
TACAATAAGG 
CGACAAGTTG 
GATAAAGTCA 
CGTTTAGGAA 
TTGGCAGGCA 
GATGAGGGGG 
AGTTTGTCTG 
CCTCCTTCGA 
TACAACCGCG 
TATACTGTAG 
ATCAAATTTG 
GATCTTTTGG 
AATGGTTTTG 
CCTGATGAAG 
AAAGAGAATC 
ACGGAAGAGA 
GAGAATTCTG 
GATCCTAGTA 
CTCGGAACGA 
AAAOAAAGTC 



AGACCCACAA 
AGAAGAATTA 
CAGTCCTTGC 
AGAAAGAGTC 
ATTTGACACC 
AGATTACGGA 
AGGTTCCTTA 
TGAAGGATTC 
AATACTATGT 
TTAAACGTCA 
CTGCAGCCAG 
ATATCATTGA 
TTCCTAAGAA 
AGCAGGGATC 
TGTCAGAGAA 
TTTCAAGCCT 
ATGGCCTTAT 
CTCATGGTAA 
TTGCTCGTAT 
AACAACCAAG 
CTCAACCAGC 
TAGGCGATGG 
TTTCAGCAGA 
ATAAGCTAGG 
CTTATGACTT 
ATTTTGAGGT 
AGTTAGTGGA 
AACCAAATGC 
AGTACACAAC 
ATGCCTATGT 
AAGCTGAGAG 
CAGACCACCA 
TGAAAGCAGC 
AAGTCAAAAA 
AGTGGTTTGA 
CGACTGTCAA 
GTAACGCTAG 
ATAAGGAACA 
ACGCTGGTTT 
CAGAGGAAGA 
TTATTAACGC 
TTAGACAAAA 
AAGATAATAA 
AACCGGCTCC 



AATGACAAGA 
AAGTGAGGAA 
CCTAAGTGTT 
TAATCGAGTT 
AGATGAAGTC 
TCAAGGTTAT 
TGATGCCATC 
AGACATTGTC 
TTACCTTAAA 
GAAGCAGGAA 
AGCCCAAGGA 
GGACACGGGT 
TGAGTTATCA 
TCGTCCTTCT 
CCACAATCTG 
TTTACGTGAA 
TTTCGACCCA 
CCATTACCAC 
TATTCCCCTT 
TCCACAATCG 
TCCAAGCAAT 
TTATGTCTTT 
AACAGCAGCA 
AGCTAAGAAA 
ACTAGCAAGA 
TTTGGATAAC 
TGATATTCTT 
GCAAATTACC 
AGAAGACGGT 
AACTCCACAT 
AGCGGCAGCC 
GGATTCAGGA 
TAAGAAGGTG 
CGGTAGTTTA 
CGAAGGCCTT 
GTACTATGTC 
TGACCATGTT 
TGATGAAGTA 
AAATCCTTCA 
AGCTGAAGAT 
TAAGATAGCA 
TGCTATGGAG 
CACTATTTCA 
TATACAGTAG 



CCTATTTAGA 
AGAATGAAAA 
TGTTCCTATG 
TCTTATATAG 
AGTAAGAGAG 
GTGACCTCTC 
ATCAGTGAAG 
AATGAAATCA 
GATGCGGCCC 
CACAGTCATA 
CGTTATACAA 
GATGCTTATA 
GCTAGCGAGT 
TCAAGTTCTA 
ACTGTCACTC 
TTGTATGCTA 
GCGCAAATCA 
TTTATCCCTT 
CGTTATCGTT 
ACTCCGGAAC 
CCAATTGATG 
GAGGAGAATG 
GGCATTGATA 
ACTGACCTCC 
ATTCACCAAG 
CTGTTGGAAC 
GCCTTCTTAG 
TACACTGATG 
TATATCTTTG 
ATGACCCATA 
CAGGCTTATG 
AATACTGAGG 
CCACTTGATC 
ATCATACCTC 
TATGAGGCAC 
GAACATCCAA 
CGTAAAAATA 
AGTGAGCCAA 
GCAGATAATC 
ACCACAGATG 
GATGCGGAGG 
ACATTGACTG 
GCAGAAGTAG 
TAAAATGAA 
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MKINKKYLAG SVAVLALSVC SYELGRHQAG QVKKESNRVS YIDGDQAGQK 
AENLTPDEVS KREGINAEQI VIKITDQGYV TSHGDHYHYY NGKVPYDAII 
SEELLMKDPN YQLKDSDIVN EIKGGYVIKV DGKYYVYLKD AAHADNIRTK 
EEIKRQKQEH SHNHNSRADN AVAAARAQGR YTTDDGYIFN ASDIIEDTGD 
^L P ™ HYIPKKELSA SELAAAEAYW NGKQGSRPSS SSSYNANPVQ 
PRLSENHNLT VTPTYHQNQG ENISSLLREL YAKPLSERHV ESDGLIFDPA 
SSSSSi t^c GNHYHF IPY£ Q MSE ^ KRIARIIPLR YRSNHWVPDS 
Srv?pI ™ Iff* 1 * PNP <> PAPSNP IDEKLVKEAV RKVGDGYVFE 
KDLSAETAAG IDSKLAKQES LSHKLGAKKT DLPSSDREFY 
NKAYDLLARI HQDLLDNKGR QVDFEVLDNL XjERLKDVSSD KVKLVDDILA 
*™™* R "t^W TDDEIQVAKL AGKYTTEDGY IFDPRDITSD 
™ SHWIKKDS LSEAERAAAQ AYAKEKGI.TP PSTDHQDSGU 
TEAKGAEAIY NRVKAAKKVP LDRMPYNIiQY TVEVKNGSLI IPHYDHYHMI 
KFEWFDEGLY EAPKGYSLED I*LATVKYYVE HPNERPHSDN GFgS^S 
™™™ S ™ DEDKEHDEVS EPTHPESDEK ENHAGLNPSA dSS 
fSf J5fI7- TDEAEIP Q VE WSVINAKIAD AEALLEKVTD PSIRQNAMET 
LTGLKSSLLL gtkdnntisa evdsllallk ESQPAPIQ „ b 
(SEQ ID NO : 14) 838 

FIGURE 17 
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TGTGCCTATQ 
TATQTGGATO 
CAGAAAGAAG 
ACGTCACACG 
AGTGAAGAAC 
GAAGTCAAGG 
GCAGCTCATG 
GTCAAAGATA 
ACGACAAATG 
TATATCGTTC 
GAATTAGCAG 
TATTCTTCAA 
GAAGAAGATG 
GTCATGAGTC 
ATTAAGGCTG 
TTGTCATCTC 
AAAATCGAAG 
GTCGTGAATA 
CCGATTGATG 
AAACCCGAAG 
ACGAATGTTG 
AATGGTCAAA 
AATATGCTAG 
GTATTTGGAG 
GGACAAACAT 
ACATTTACAG 
TTCCATGCAG 
GATGCTTTAG 
AAAGTTGGTG 
GGAAATAAAA 
ATTGTGGAAG 
CAATTTAAAA 
AAGACTAGTG 
TCAACGTTAG 
GAAAGTTATG 
TATTTACCAT 
CAAGGAAATG 
AACCAACCAA 
GTAAAACCAG 
GACCCTATGT 
TTAGAAAAAT 
GGAACGATTG 
TAGCGTAA 



CACTAAACCA 
GCAGCCAGTC 
GAATTCAGGC 
GTGATCACTA 
TCTTGATGAA 
GTGGTTATAT 
CTGATAATGT 
ATGAGAAGGT 
ATGGTTATGT 
CTCATGGAGG 
CAGCTAAAGC 
CACCTTCTCC 
GATACGGATT 
ACGGAGACCA 
CGCAAAAACA 
ATGAACAGGA 
AAAAAATTGC 
AAGAAAAAAA 
AACATAAACC 
AAGGAGTTGC 
TTAATTTGTT 
AACGCGTTTC 
TAAAATTAAT 
AAGGAGTAGG 
TTAAGTATAC 
TTCCAACCTC 
GGGATACTTA 
TCAGAGTGTT 
AAATCAAATT 
TTCCTGTAAC 
TACCTATCTT 
GGAATAAAGC 
AGAAGGTAGA 
AAGAAGTTCC 
GGATGAAGCT 
CGGGAGAAGT 
GTGAAAATAA 
CAGAAAATAA 
AAAACTCAAC 
TAGATTCAGC 
TTACAGCTAG 
AATTAAGATT 
(SEQ ID NO 



GCATCGTTCG 
AAGTCAGAAA 
TGAGCAAATT 
TCATTACTAT 
GGATCCAAAC 
CATCAAGGTC 
TCGAACTAAA 
TAACTCTAAT 
CTTTAATCCA 
TCACTATCAC 
ACATCTGGCT 
ATCTCTTCCA 
TGATGCTAAT 
CAATCATTAT 
TTTAGAGGAA 
TTATCCAAGT 
TGGCATTATG 
TGCGATTATT 
GGTTGGAATT 
TAAAAAAGAA 
AAAAAATAGT 
TTTTAGTTTT 
AACACCAGAT 
GAATATTGCA 
TATCGCTTCA 
TTTAGCTTAC 
TTTAAGAGTG 
TGATGAATTT 
ACCGATTCCG 
CTTCATGGCA 
GGAAAAAGAA 
AGAAGAAAAC 
AAAAGAAAAA 
TACAGTGGAT 
AGAAAATGTC 
CATTAAAAAG 
ACCATCTGAA 
ACCAGCAGAT 
GGATAATGGA 
ATTAGAGGAA 
TTACGGATTA 
GCCAAGTGGA 
: 15) 



CAGGAAAATA 
AGTGAAAACT 
GTAATGAAAA 
AATGGGAAAG 
TATCAACTTA 
GATGGAAAAT 
GATGAAATCA 
GTTGCTGTAG 
GCTGATATTA 
TACATTCCCA 
GGAAAAAATA 
ATCAATCCAG 
CGTATTATCG 
TTCTTCAAGA 
GTTAAAACTA 
AATGCCAAAG 
AAACAATATG 
TATCCGCATG 
GGTCATTCTC 
GGGAATAAAG 
ACGTTTAATA 
CCGCCTGAAT 
GGAAAAGTAT 
AACTTTGAAT 
AAAGATTATC 
AAAATGGCCA 
AACCCTCAAT 
CATGGAAATG 
AAATTAAACC 
AATGCTTATT 
AATCAAACTG 
TCAAAACTTG 
CTTTCTGAAA 
CCTGTACAAG 
TTGTTTAATA 
AATATGGCAG 
AATGGAAAAG 
TCTTTACCAG 
ATGTTGAATC 
GCTCCAGCAG 
GGCTTAGATA 
GAAGTGATAA 



AGGACAATAA 
TGACACCAGA 
TTACAGATCA 
TTCCTTATGA 
AAGACGCTGA 
ATTATGTCTA 
ATCGTCAAAA 
CAAGGTCTCA 
TCGAAGATAC 
AAAGCGATTT 
TGCAACCGAG 
GAACTTCACA 
CTGAAGATGA 
AGGACTTGAC 
GTCATAATGG 
AAATGAAAGA 
GTGTCAAACG 
GAGATCACCA 
ACAGTAACTA 
TTTATACTGG 
ATCAAAACTT 
TGGAGAAAAA 
TGGAGAAAGT 
TAGATCAACC 
CAGAAGTAAG 
GTCAAACGAT 
TTGCAGTGCC 
CTTATTTAGA 
AAGGAACAAC 
TGGACAATCA 
ATAAACGAAG 
ATGAAAAGGT 
CTGGGAATAG 
AAAAAGTAGC 
TGGACGGAAC 
ATTTTACAGG 
TATCTACTGG 
AGGCACCAAA 
CAGAAGGGAA 
TAGATCCTGT 
GTGTTATATT 
AAAAGAATTT 



TCGTGTCTCT 
CCAGGTTAGC 
GGGCTATGTA 
TGCCCTCTTT 
TATTGTCAAT 
CCTGAAAGAT 
ACAAGAACAT 
GGGACGATAT 
GGGTAATGCT 
ATCTGCTAGT 
TCAGTTAAGC 
TGAGAAACAT 
ATCAGGTTTT 
AGAAGAGCAA 
ATTAGATTCT 
TTTAGATAAA 
TGAAAGTATT 
TCATGCAGAT 
TGAACTGTTT 
AGAAGAATTA 
TACTCTAGCC 
ATTAGGTATC 
ATCTGGTAAA 
TTATTTACCA 
TTATGATGGT 
TTTCTATCCT 
TAAAGGAACT 
AAATAACTAT 
CAGAACGGCC 
ATCGACTTAT 
TATTCTACCA 
AGAAGAACCA 
TACTAGTAAT 
AAAATTTGCT 
AATTGAATTA 
AGAAGCACCT 
AACAGTTGAG 
CGAAAAACCT 
TGTGGGGAGT 
AGAAGAAAAA 
CAATATGGAT 
ATTGATCTCA 
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480 
540 
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660 
720 
780 
840 
900 
960 
1020 
1060 
1140 
1200 
1260 
1320 
1380 
1440 
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S =E S 5 == SSK 

ssxg fsssss isext 

SSSSl SHE JK2SK ~S SSSSS 

lssheodyps nakSSSJ SSI?™ 0 ikaaqkhle * vktshnolds 
yphgdS SJSSJS? SS^™ kqygvkr s« wnkeknaii 
townS JSSSS kpeegvakke gnkvytgeei. 

GKVLEKVSGK VFOBoSSJ SSfSIf™ PPELEKKI ^^ NMI»VKLXTPD 

HGNAYLENNY KVGEIKLPIP K^S2^» NPQFAVPKGT DALVRVFDEF 
IVEVPILEKE JSSSSSJ SEES SK*™ 
LSETGNSTSN STLEEVPTTO !^° EKVEEP KTSEKVEKEK 

ess sssE 3£ F= s 

STSE ?^F™™ APAVDPVQEK 
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FIGURE 19 



SSSSSl SSSSSES 2S55K Sf TPDQVS ~™»-oi 

EVKGGYIIKV DOKYYVYLTO SEELLMKDPN YQLKDADIVN 100 

VAVARSQGRY *35JS5 SSS v^^^" VKDN ^N 15 o 

ELAAAKAHLA GKNMQPSQLS mSSSS ™ P *<*UYH YIPKSDLSAS 200 
I^KELYDSPS AQRYSES0GL S5SS5 PANKSBN "> S 

LSALEEKIAR MVPISGTGST VStESS JSSSfS" 0 DHYHPIP * SK 300 

SSASDGYIFN PKDIVEETAT ££££££ PSSLTTSKSL 350 

TPSPSLPINP GTSHEKHEED rv3^ "YIPKSNQIQ QPTLPHNSIA 400 

KDLTEEQIKA J»2S5S sSgTS iSS™" «J 

EKIAGIMKQY GVKRESIvS S^rvff HEQDYPGNAK EMKDLDKKIE 5 00 

HSNYELFKPE EGVAKkS S !Z GD HHHADPID EHKPVOIGHS 550 
KRVSFSFPPE LEkJES v^S SfS NQNFTLANGQ 
LDQPYLPGQT FKYTIASimv or^oi™ L EKVSGKVFG EGVGNIANFE 
GDTYLRVNPQ FA^P^^ Wvlr^l™ VPTSIiAYKMA SQTIFYPFHA 

qottrtagnJ 2JEE2J SS i5fS! ncw 

RNKAQENSKL DEKVEEPKTS E KVE KE K r -SS™* D «"WK 
EKVAKFAESY GMKLEKVLPM TCNSTSNSTL EEVPTVDPVQ 

QENKPSENQK £££££ EEESH f? EVIKKNMA DFTGEAPQGN 
PEGNVGSDPM U>J5E5X SSwSeJ ESS™" ENS ™<»™ 
ELRLPSGEVI KKNLSOFTA (slt ^O^ SS^ SVIFNMD0 ™ "00 
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SfSJSSX QENKDNNRVS YVDGSQSSQK SENLTPDQVS QKBGIQAEQI so 

TSHGDHYHYY NGKVPYDALF SEELLMKDPN YOLKDADIVM , 

=s sr™* ss ^ 

™- SSSSSS SSSS 2 2 s°o° 
EES 3SSS JK55 S-SS 3= 

FIGURE 21 
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FIGURE 22 
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180 



SSS5S SIS 2S55 ITOJS0SS0K sa " IMWS 

EVKGQYIIKV rviK^vw7^ Jf,,. NGKVPYDALF SEELLMKDPN YQLKDADIVN > 

SSS£ SSS SSSSS SEEK vavah^S _„ 

YSSTASDNNT QSVAKOSTSK !72fSSSf ELAAAKAm * GKNMQPSQLS 240 

TPNGVAIPHG DHYHFIPYsk SJS^- AQRYSESDGL- VFDPAKIISR 300 

PSSLTTSKEL SSASDGYIFN JSJXSS ^t^u™ VSTNAKPNEV VSSLGSLSSN 360 

TPSPSLPIKP GTSHBKHEED JE^SS** HnPKSNQIG QPTLPNNSLA 420 

AQKHLEBVKT SHHGlSSs SEEK* ^^oTsT^ «»™™ 



EiE sSEs o™™ s ™ ssss 

esse = =i EEi iS « 
asss ssssss sssss ssE: Eke 
sees sssse ss= £E EEs ~ 

' ' 1057 
FIGURE 23 

22/33 



SUBSTITUTE SHEET (RULE 26) 



WO 00/39299 



PCT/CA99/012I8 



vtv^^ QENKDNNRVS YVDGSQSSQK SENLTPDQVS QKEGIQAEQI 

S5SSS 5SKKS SEEKS ESSE JSSSS 

' 205 
FIGURE 24 



CAYELGLHQA QTVKENNRVS YIDGKQATQK TENLTPDEVS KREPTmbpot 

2522™ tshqd «™yy ngkvpydS? SEEKS 
SSSS? S^ LKD eeinrqSS XSSSS 

NDGAVAFARS QGRYTTDDGY IFNASOIIED TGDAYIVPHG DHYHYIPKNP 
LSASELAAAE AFLSGRENLS NLRTYRRQNS DNTPRTNWVP SVsSS? 
NTSIOJSNTNS QASQSNDIDS LLKQLYkZpL sSvESDgT SSES5 

I«-««R IIPLRYRSNH SI 
SPgPTPEPSP SPQPAPNPQP APSNPIDBKL VKEAVRKVGD GYVFEENGV*! 
RYIPAKNLSA ETAAGIDSKL AKQESLSHKL GAKKTDLPSS DrIfS^YD 

sss sssss; sssss 

iSSTSS QVAKLAGKYT TEDGYIFDPR DITSDEGOAY 

S22S5 SSSS X?Sr K SESS SEES 
SSSS SSSS 52== — S 
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QDNNTIMAEA E KLL ALL KES K U^Q » "STST 0U0 "™ 

FIGURE 25 



CAYELGLHQA QTVKENNRVS YIDGKQATQK TENLTPDEVS KREGIMAEOI 
VIKITDQGYV TSHGDHYHYY NGKVPYDAII SEELLMKDPN 
EIKGGYVIKV NGKYYVYLKD AAHADKVRTK EEInS£S ShSSt^ 

SEES SEE 2S5S SSsS SEE 

s= sssss sees sr— ™ = j- 

(SEQ ID NO : 61) 334 

FIGURE 26 
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RYRSNHWVPD SRPEEPSPQP TPEPSPSPQP APNPOPAPSN PTDFjctwi** 

S5E2 2- SSSS IS^SSi 

DKVKLVDDIL SS3SSSS SSSSS SS LAGKYTTEDG 
DEGDAYVTPH MTHSHWIKKD SLSEAERAaJ SS525 

SSSSS S^, 1 YNRVKAAKKV ^«mp^S Sve^gsI 

^"VT IKFEWFDEGL YEAPKGYTLE DLLATVKYYV EHPNERPHSD 

NGFGNASDHV QRNKNGQADT NQTEKPSEEK PQTEKPEEET PREEKPOSEK 
EEESPEESEB PQVETEKVEE kSeAEdSg SSSSS 
AKETLTGLKN NLLFGTQDNN TIMABABKLL ALLKESK 
(SEQ ID NO i 62) 487 

FIGURE 27 
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AEAFLSGREN LSNLRTYRRQ NSONTPRTNW VPSVSNPGTT NTNTSNNSNT 
NSQASQSNDI DSLLKQLYKL PLSQRHVESD GLIFDPAQIT sSwAW 

EQMSELEKRI aruplryrs nvmS fSStSSZ 

SPSPQPAPNP QPAPSNPIDB KLVKEAVRKV QDQYVFEENQ VSRYIPAKNL 
P^ 0 ^ 5 " KLGAKKTOLP SSDREFYNKA ££££££ 
FEALDNLLER LKDVSSDKVK LVDDILAFLA PIRHPERLQK 
PHAQITYTDD EIQVAKLAOK YTTEDGYIFD PRDITSDEGD AYVTPHMTHS 
HWIKKDSLSE AERAAAQAYA KEKGLTPPST DHQDSGNTEA SSSSSJ 
^AKKVPLDR MPYNLQYTVE VKNGSLIIPH YDHYHNIKFE SSSJSS 
KGYTLEDLLA TVKYYVEHPN ERPHSDNOFG NASDHVQRNK NGQADTNQTE 

kpseekpqte kpeeetpree kpqsekpesp kpteepeees peSSS 

SEEK SSTSS S'SfTS, L — » GTQDNNTIMA „. 

FIGURE 28 
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SISS^ vim HNGLDSLSSH EQDYPGNAKE MKDLDKKIEE 

KIA3IMKQYG VKRESIWNK EKNAIIYPHG DHHHADPIDE HKPVGIGHSH 
SHYELFKPEB GVAKKEGNKV YTGEELTNW NU^TFNN OOTxSS 
RVSFSFPPEL EKKLGINMLV KLITPDGKVL EKVSGKVFGE SS£SS££ Inl 
DQPYLPGQTF KYTIASKDYP BVSYDGTFTV PTsZaYKmS SESSSSSZ III 
DTYLRVNPQF AVPKGTDALV RVFDEFHGNA YLe£X!SS JSSSSq 

EEE2 2K™2 "5—5?? »«52 



NKAQENSKLD EKVEEPKTSE KVEKEKLSET StSZSZ ESSE III 

KVAKFAESYG MKLENVLFNM DGTIELYLPS GEVlSSS £££££ ill 

zsssss. jsss ™ lpe n™s 

sesss sssir bk ™ LDS vifnmdotie »!» 



568 



£Z5££ S55S2F BI,MWB EQDYPGNAKE MKDLDKKIEE 50 

™= VKRESIWNK EKNAIIYPHG DHHHADPIDE HKPVGIGHSH 100 

SNYELFKPEB GVAKKEGNKV YTGEELTNW NLLKNSTFNN QNFTLANGOK «J 

RVSFSFPPEL EKKLGINMLV KLITPDGKVL EKVSGKVFGE GvSSJS 200 

DQPYLPGQTF KYTIASKDYP BVSYDGTFTV PTSLAYKMAS SSSSS 250 

DTYLRVNPQF AVPKGTDALV RVFDEFHGNA YLENNYKVGE IKLPIPkSS „ 

GTTRTAGNKI PVTFMANAYL DNQSTYIVE (SEQ S NO f ° JJJ 

FIGURE 30 
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EVPIU3KENQ TDKPSILPOF KRNKAQENSK LDEKVEEPKT SEKVEIfPirt e 

eessz S£r pv qekvakfaes « ssss: 

nf?^ ADFTGEAPOG NGENKPSENG KVSTOTVENQ PTENKPADSL 

SSSES nlZrT™ L NPEGNV ^DP MLDPALEEAP 
S?S?2 IELRLPSGEV I««*DFIA 



FIGURE 31 
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"SIS SSS^ f"™MO ITSRTAROVA VPHGNHYHFI 50 

s£E EE =x »— ssk 

===== 

LKESK (SEQ ID »oT 6 ^ LFGTQDNNTI MAEAEKLLAL 550 

FIGURE 32 555 



E22£ "SSSS5 ST"- -™* ^ = - 

FIGURE 33 



Ess 25X2 s=ss ass j: 

EEPQVETEKV EEKLREAEDIi L (SEQ ID HO t 69) EPEEESPEES 10 0 

FIGURE 34 



USEES 2SSSS £££££ SET - - PQSEKPESPK 

TGLKNNLLFG TQDMNTIMAE AEKLLALLKE ™ "KSKAKETI. 

(SEQ ID NO : 70) "2 

i FIGURE 35 
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(SEQ ID NO : 72) 

FIGURE 37 



FIGURE 38 



FIGURE 39 
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SK5E SSSS JS5~JSS? ~ 

FIGURE 36 226 

££££ SKEf ss rtr 0 * 1 ™ «""»» 

=S2 255£ Si —»* » 

DHV (SEQ ID N^,, mUAWK YYVEHPNERP HSDNQFGNAS 200 



203 



JSSSS SSEES? J™*"™ SKREQIMAEQ 
NEIKGGYVIK ^irvv,™ 5 ™ GKVPYDAI "EELUIKDP NYQLKDSDIV 

S?SSSf S^?p^ keeikrqkqe HSHNHNSkS 

ASEIAAAEAY MraKrtf<££? oef DAYIVPHGDH YHYIPKNELS 

SSSS^ SpS2 JSS^ TVTPTYHQKQ 

fipyeqmsel ekS?iS SSS£2£ aoitsrta *° vavphonhyh 

AP N p Q PAP SN JSSSS S PSPQS TPEPS ^P 

OIDSKLAKQB SI*h£££ S2S5S SEES EX"*"** 

RQVDFEVLDN LLERLKDVQ q XZZl^ZZf^ YNKAYDLLAR IHQDLLDNKO 
YTDDEIQVAK SS5SE ^ IRHPE ^KPNAQIT 

~ ~ SEE SEE 

ssss EES i = = asss 

ENSVINAIfTA ^f^ PS ADNLYKPSTD TEETEEEAEO TTDEAEIPOV 
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819 



50 



SSSSSZ ESDG " F °™ QITSRTARGV AVPHGNHYHF 

SSSSS 2SS2S JSSSSL SKJ 0 " W8 ^SS 

IDSKLAKQES LSHKLQAKKT Ef^e™** ENGVSRYIPA KDLSAETAAO 150 

sees i~ =E — — sssss js 

SEES dEE E= sssss SSSS3SK JS 

ss==s S 

EVDSLLAIiLK E^^I^Hf: GTK ™™A 
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n^^2° S KPDEDKEHDE VSEPTHPESD EfCENHAGLNP SADNLYKPST so 
SSSS^ "V™**** ADAEALLEKV TDPSIrSSJ 1 

<£FJF£ SAEVD8UAL LKESQPAPIQ J™ 

FIGURE 40 



QACTTGACAG AAQAQCAAAT TAAGGCTGCG CAAAAACATT TAGAGGAAGT 
TAAAACTAGT CATAATGGAT TAGATTCTTT GTCATCTCAT 2E5£I£ 
ssssrrnlrn TGCCAAAGAA ATGAAAGATT TAGATAAAAA A^SaS 
AAAATTGCTG GCATTATGAA ACAATATGGT GTCAAACGTO AAAGTATTGT 
COTQAATAAA GAAAAAAATG CGATTATTTA TCCGCATGGA GAxScSc 
ATGCAGATCC GATTGATGAA CATAAACCGG TTGGAATTGG 
ZZSSSEH f CTQTTTAA ACCCGAAGAA GGAGTTGCTA 

TATACTGGA ° AAGAATTAAC GAATGTTGTT AATTTGTTAA 
AAAATAGTAC GTTTAATAAT CAAAACTTTA CTCTAGCCAA TGGTCAAAAA 
CGCGTTTCTT TTAGTTTTCC GCCTGAATTG GAGAAAAAAT SSS^ 
XATGCTAOTA AAATTAATAA CACCAGATGO AAAAGTATTG ££££££ 
CTOOTAAAGT ATTTGGAGAA GGAGTAGGGA ATATTGCAAA CTTTGAATtI 
GATCAACCTT ATTTACCAGG ACAAACATTT AAGTATACTA TCGCTTCAAA 
AGATTATCCA GAAGTAAGTT ATGATGGTAC ATTtSS SSSSJi 
TAGCTTACAA AATGGCCAGT CAAACGATTT TCTATOCTTT cSJSSS 
GATACTTATT TAAGAGTGAA CCCTCAATTT GCAGTGCCTA SSSaS 
TGCTTTAGTC AGAGTGTTTG ATGAATTTCA TGGAAATGCT SS^SS 
AaTTaGTGAA ATCAAATTAC COATTcSS IJSSSaI 
GGAACAACCA GAACGGCCGG AAATAAAATT CCTGTAACCT TCATGGCAAA 
TGCTTATTTG GACAATCAAT CGACTTATAT TGTGGAAGTA CCTATCTTGG 
AAAAAGAAAA TCAAACTGAT AAACCAAGTA TTCTACCACA aSSSSg 
AATAAAGCAC AAGAAAACTC AAAACTTGAT GAAAAGOTAG AAGAACCAAA 
252SS AAGGTAGAAA AAGAAAAACT TTCTGAAACT £££££ 
CTAGTAATTC AACGTTAGAA GAAGTTCCTA CAGTGGATCC TGTACAAGAA 
AAAGTAGCAA AATTTGCTGA AAGTTATGGG ATGAAOCTAG AAAATGTCTT 
GTJTAATATG GACGGAACAA TTGAATTATA TtSSSS SS 
TTAAAAAGAA TATGGCAGAT TTTACAGGAG AAGCACCTCA AGGAAATGGT 
GAAAATAAAC CATCTGAAAA TGGAAAAGTA TCTACTGGAA S^SISI 
SiSSSS GAAAATAAAC CAGCAGATTC TTTACCAGAG GCACCAAACG 
AAAAACCTGT AAAACCAGAA AACTCAACGO ATAATGGAAT GTTGAATCCA 
GAAGGGAATO TGGGGAGTGA CCCTATGTTA GATCCAGCAT TAGAGGAAGC 
QATCCTGTAC AAGAAAAATT AGAAAAATTT ACAGCTAGTT 
ACG^TTAGO CTTAGATAGT GTTATATTCA ATATGGATGO AACGATTGAA 
TTAAGATTGC CAAGTGGAGA AGTGATAAAA AAGAATTTAT CTGAtScaT 
AGCGAAGCTT CGTTATCGTT CAAACCATTG GGTACCAGAT tSJSSSg 
AAGAACCAAG TCCACAACCG ACTCCAGAAC CTAGTCCAAG TCcSSSS? 
GCACCAAATC CTCAACCAGC TCCAAGCAAT CCAATTGATG AGAAATTGGT 
CAAAGAAGCT GTTCGAAAAO TAGGCGATGG TTATGTCTTT ££SaS£ 
GAGTTTCTCG TTATATCCCA GCCAAGAATC TTTCAGCAGA AACAGCAGCA 
in^TlsnjIn OCAAACTGGC CAAGCAGGAA AGTTTATCTC ££££££ 
AGCTAAGAAA ACTGACCTCC CATCTAGTGA TCGAGAATTT TACAATAAGG 
CTTATGACTT ACTAGCAAGA ATTCACCAAG ATTTACTTGA TaSaA^S 
CGACAAGTTG ATTTTGAGGC TTTGGATAAC CTGTTGGAAC GACTCAAGgI 
OATAAAGTCA AGTTAGTGGA TGATATTCTT SSSiSS 
CTCCGATTCG TCATCCAGAA CGTTTAGGAA AACCAAATGC GCAAATTACC 
TACACTGATG ATGAGATTCA AGTAGCCAAG TTGGCAGGCA AGTaSSJc 
iSSSf^I TATATCTTTG ATCCTCGTGA TATAACCAGT G^TGAGGGGQ 

ATGCCTATGT aactccacat atgacccata gccactogat taaaaaagat 
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^ZZ°J^ a AAQCTGAGAQ AGCGGCAGCC CAGGCTTATO CTAAAGAQAA 
n^TT™^ CCTCC TTCGA CAGACCATCA GGATTCAGGA AATACTGAGG 
r£^™ C AQAAGCTATC TACAACCGCG TGAAAGCAGC TAAGAAGGTG 
SSSE? QTATGCCTTA CAATCTTCAA TATACTGTAG AAGT^aS 
SK^ST* ATCATAC CTC ATTATGACCA TTACCATAAC ATCAAATTTG 
?S™ A CGAAG0CCTT TATGAGOCAC CTAAGGGGTA XSJJJSE 
CGACTGTCAA GTACTATGTC GAACATCCAA AcSSSk 
aSSES** AATGGTTTTG GTAACGCTAG CGACCATGTT cSSSSS 
AAAATGGTCA AGCTGATACC AATCAAACGG AAAAACCAAG CGAGCATAA& 
^™^ Q AAAAACCTGA GGAAGAAACC SS£SE 

CCAGAGTCTC CAAAACCAAC AGAGGAACCA SSSSSS 
liBrrotn^ ATCAGAAGAA CCTCAGGTCG AGACTGAAAA SSS 
JUSXSS AGQCTQAA0A TTTACTTOGA AAAATCCAGG AtcSJJSS 

ss?s== 
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FIGURE 4X 



EAYWNGKQGS RPSSSSS YNA NPVQPRLSEN HNLTVTPTYH n M nPtr MT cnr 

sees sees ss =S - 

PSNPIDEKLV KEAVRKVQDG YVPPPMPuen LQPAPNPQPA 15 0 

FIGURE 42 
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CAYALNQHRS QENKDNNRVS YVDGSQSSQK SENLTPDQVS OKEGIOAEOI 

ZSSSSSSZ ZSS^SZ™ 0 ™ SEELLMKDPN ES 
VAvSsoS AA " ADNVR ™ DEIMRQKQEH VKDNEKVNSN 

e2S C? IIEDTGNA "VPHGGHYH YIPKSDLSAS 

LUffiJJSSi A^v2f S ° LS YSSTA SDNNT QSVAKGSTSK PANKSENLQS 
AQRYSESDQL VFDPAKIISR TPNGVAIPHG DHYHFIPYSK 

Z^tl VS ™«*™ VSSWSLsS SSSSS 
SSSSS** AyiVRHGD » F HYIPKSNQIG QPTLPNNSLA 

iEi EE °~ == 
EE EE s= sssss 
ssS ~ MS ss 



RYIPaJSt f ! PQP ^SNPIDEKL VKEAVRKVGD GYVFEENGVS 

f^f^ * ETAAGID SKL AKQESLSHKL GAKKTDLPSS DREFYMKAYD 

SSS^f dvssdS SSSS 

5?PHM™«HW rS 0061 QVAKLAGKYT TEDGYIFDPR DITSDEGDAY 

SPSS- 



FIGURE 44 
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gSaS^ 2SSS2 £5£S SSI GCCA TACTCATATT 
TATATTGATQ AcSSS^ JSSXS arr^r^ TCAGATTGCC 
ATCAGTQCTO AAGAAGGCA? ^gSSa^ CAAACAAAAC GATGGATCAA 

TATGTGACCT CAC^S S£££ toSSS TGACCAAGGC 
ATTATTAGTG AAGAGTTGTT GATGACGGAT ffSff 0 ™* ^ATGATGCG 

ATCAATGAAA TCTTAGArrn ^™ CCTAATTACC GTTTTAAACA ATCAGACGTT 
AAGCCAGSS SESSSX X£^2 GCAACTATTA TGTTTACCTC 

AAAGGAACTA AAGAAGCTAA ISSSS iJSJISJS AAATTGCTGA GCAAGTAGCC 
GAAGTTGCGG CAGTCAATOA £S^a^ J22Sf TGGCCCATCT CAGTAAAGAA 
ATTTTTAGTC CGACAGaSJ SS^J ^^ CGCT ATACTACAGA CGATGGCTAT 
AATCACTATC ATTATATTCC SZSS^l ^° AGATG ^ATTTAGT ACCTCATGGT 
GCCTACTGGA GtS£££ I^ahS ESS™** GTGAGCTA GC TGCTGCACAA 
GCCCCAGGTC GTAGGAAAGC SE5^ GCTAGACCGT CTGATTACCG CCCGACACCA 
CAGCCAGATA aSSSS^A SSS £SS^ CTAACCCTGG ACAAGGTcS 
AAACACCAAA GAGATGAGTT SKSE SSS?^ CAAATGATGC OTCACAAAAC 
CGTCTTGATT TGAAATACCG £55^ TCAACTACAC 

GTGATCAAAT CAAACGCTTT Trnnr^^ ^AGATGGGT TGATTTTTGA ACCGACTCAA 
CCAAGAAGTC JSSS2E GAGATCATTA TCATATTATC 

ACTGAGOACA ATGACTCAGG A ? CGATACCT AGCTGGCCAA 

ACCTTTCTTG GTCATCGCAT CAAAGCTTAC <* GATAA AGA AGTGACACAT 

ACGAGTGATG CTTATGTTTT S££S^ T^^Z TAGATGGTAA ACCATATGAT 
ACAGCTAAAC ACGGAGATCA XSSISK 2^™~ CAGTGGATAA ATCAGGAGTT 
TTGGATGAGG TCGCTAACTG ^2S£ » 1™***° ^AGAACTTGA ACAATATOAG 
TTGGATCAGG JUuSSES AgIa2£c£ £2™°™ TOCTGG ™CT 

GTAACAAAAG ATGGTAAAGT GGGCT^ATO 5S^f£ CTOAAAAAGT GAGTCGCAAA 
GCTCGTGATC AACTTGATTT OAC^SS g^,^ ATGGTAAGGA CTATTTCTAT 
GATAAGAAGC ATTaSgJS SS£5££ SS! AATGCTTAAA 
GATGTGTCAA GTCTGCCGAT GCaSSS TTGAGCCACG ACTTGCTGTA 

gttatcccac atattgatca tScpaS ^ gctactt acgatactgg aagttcgttt 

ATTGCAACAG tSaSS ^GTATT CATGGTTGAC GCGCGATCAG 

CCAGGGCATG AAGAOTCaS SSSJSJ ^^5° GTCCQGA TGT ATGGTCTAAG 
GGTATGCCAA ACTgS^ SS£55S S3SE2 CGCCTCTTGA TAAACGTGCT 
GGTCGTTTTG CAACACCAGA rnn^^ZZ GCTGA AGAAG TTCAAAAAGC CCTAGCAGAA 
ACTTTTGTAT £ESS£ S££52 «««^ GGCCAAAgJJ 

ACCATTAATA AATCTGATCT ^C^SS SSS^ ^^CAG TTCATTGAGA 

aagaaaaata ctggtgaSJ 5S2E5 aagctcaaga gttattggca 

AAGAGCAATO AAAACCAACA Q^SSSS nnl^^ AAGAAAAGCA ACAGGCAGAT 

tttatagaca GrrrAccJS SSSSSJ aagaaaaaga atcagatgac 

CAATTAGCAC AAAAAGCTAA TA^SmS CCCTAGAAGA TCATATCAAT 

CAATTTTATA ATAAAAATgS S^SS TTTTCCAACC AGAAGGTGTC 

AACCCTTAA (MoY™ NoTtJT ACTTATGATA TCAAGACACT TCAACAAATA 
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™ YQYIGS VAAILLA ™* GSYQLGKHHM GLATKDNQIA YIDDSKGKAK 
APKTNKTMDQ ISAEEGISAE QIWKITDQO YVTSHGDHYH FYNGKVPYDA 
i^rn™ PNYRFK Q^V INEIhDOYVI KVNGNYYVYL KPGSKRKNIR 
TKQQIAEQVA KGTKEAKEKG LAQVAHLS KE EVAAVNEAKR QGRYTTDDGY 
IFSPTDIIDD LGDAYLVPHG NHYHYIPKKD LSPSELAAAQ AYWSQKOGRG 
APGRRKAPIP DVTPNPGQGH QPDNGGYH^ iJSSSS 
KHQRDEPKGK TFKELLDQLH RLDLKYRHVE EDGLIFEPTQ ViSnaFGYV 
VPHGDHYHII PRSQLSPLEM ELADRYLAGQ TEDNDSGSEH SUSSES 
iSSfiSJ GKGLDGKPYD TSDAYVFSKE SIHSVdSS! A« 
IGFGELEQYE LDEVANWVKA KGQADELAAA LDQEQGKEKP LFDTKKVSRK 
VTKDGKVGYM MPKDGKDYFY ARDQLDLTQI AFAEQELMLK XSSS" 
DTGIEPRLAV DVSSLPMHAG NATYDTGSSF VIPHIDHIHV 
IATVKYVMQH PEVRPDVWSK PGHEESGSVI PNVTPLDKRA GMpSSS 
AEEVQKALAE GRFATPDGYI FDPRDVLAKE TFVWKDGSFS So S 
TINKSDLSQA EWQQAQELIA KKNTGDATDT DKPKEKQQAD SS 
ASKEEKESDO FIDSLPDYGXi DRATLEDHIN QLAQKANIDP 55K2J 
QFYNKNGELV TYDIKTLQQI NPP (SEQ ID NO • 81) 



50 
100 
ISO 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
823 



FIGURE 46 



31/33 



SUBSTITUTE SHEET (RULE 26) 



WO 00/39299 



PCT/CA99/01218 



QTOAAGAAAA 
GGAAGTTACC 
TATATTGATQ 
ATCAGTGCTG 
TATGTGACCT 
ATTATTAGTG 
ATCAATGAAA 
AAGCCAGGTA 
AAAGGAACTA 
GAAGTTGCGG 
ATTTTTAGTC 
AATCACTATC 
GCTTACTGGA 
GCCCCAGGTC 
CAGCCAGATA 
AAAGACCAAA 
CGTCTTGATT 
GTGATCAAAT 
CCAAGAAGTC 
ACTGAGGACA 
ACCTTTCTTG 
ACGAGTGATG 
ACAGCTAAAC 
TTGGATGAGG 
TTGGATCAGG 
GTAACAAAAG 
GCTCGTOATC 
GATAAGAACC 
GATGTGTCAA 
GTTATCCCTC 
ATTGCAACAA 
CCAGGGCATG 
GGTATGCCAA 
GGTCGTTTTG 
ACTTTTGTAT 
ACCATTAATA 
AAGAAAAACG 
AAGAGCAATG 
GACTTTATAG 
AATCAATTAG 
GTCCAATTTT 
ATAAACCCTT 



CATATGGTTA TATCGGCTCA 
AACTTGGTAA GCATCATATG 
ATAGCAAAGG TAAGGCAAAA 
AAGAAGGCAT CTCTGCTGAA 
CACACGGTGA CCATTATCAT 
A AGA GTTGTT GATGACGGAT 
TCTTAGACGG TTACGTTATT 
GTAAGCGCAA AAACATTCGA 
AAGAAGCTAA AGAAAAAGGT 
CAGTCAATGA AGCAAAAAGA 
CGACAGATAT CATTGATGAT 
ATTATATTCC TAAAAAAGAT 
GTCAAAAACA AGGTCGAGGT 
GTAGGAAAGC TCCAATTCCT 
ACGGTGGCTA TCATCCAGCG 
GAGATGAGTT TAAAGGAAAA 
TGAAATACCG TCATGTGGAA 
CAAACGCTTT TGGGTATGTG 
AGTTATCACC TCTTGAAATG 
ATGATTCAGG TTCAGATCAC 
GTCATCGCAT CAAAGCTTAC 
CTTATGTTTT TAGTAAAGAA 
ACGGAGATCA TTTCCACTAT 
TCGCTAACTG GGTGAAAGCA 
AACAAGGCAA AGAAAAACCA 
ATGGTAAAGT GGGCTATATT 
AACTTGATTT GACTCAGATT 
ATTACCGTTA TGACATTGTT 
GTCTGCCGAT GCATGCTGGT 
ATATTGATCA TATCCATGTC 
TCAAGTATGT GATGCAACAC 
AAGAGTCAGG TTCGGTCATT 
ATTGGCAAAT CATCCATTCT 
CAACACCAGA CGGCTATATT 
GGAAAGATGG CTCCTTTAGC 
AATCTGATCT ATCCCAAGCT 
CTGGTGATGC TACTGATACG 
AAAACCAACA GCCAAGTGAA 
ACAGTTTACC AGACTATGGT 
CACAAAAAGC TAATATCGAT 
ATAATAAAAA TGGTGAATTA 
AA (SEQ ID NO : 82) 



GTTGCTGCCA 
GGTCTAGCAA 
GCCCCTAAAA 
CAGATCGTAG 
TTTTACAATG 
CCTAATTACC 
AAAGTCAATG 
ACCAAACAAC 
TTAGCTCAAG 
CAAGGACGCT 
TTAGGAGACG 
TTGTCTCCAA 
GCTAGACCGT 
GATGTGACGC 
CCTCCTAGGC 
ACCTTTAAGG 
GAAGATGGGT 
GTGCCTCATG 
GAATTAGCAG 
TCAAAACCAT 
GGAAAAGGCT 
TCCATTCATT 
ATAGGATTTG 
AAAGGTCAAG 
CTCTTTGACA 
ATGCCAAAAG 
GCCTTTGCCG 
GACACAGGTA 
AATGCTACTT 
GTTCCGTATT 
CCCGAAGTTC 
CCAAATGTTA 
GCTGAAGAAG 
TTCGATCCAC 
ATCCCAAGAG 
GAGTGGCAAC 
GATAAACCCA 
GCCAGTAAAG 
CTAGATAGAG 
CCTAAGTATC 
GTAACTTATG 



TTTTACTAGC 
CAAAGGACAA 
CAAACAAAAC 
TCAAAATTAC 
GGAAAGTTCC 
ATTTTAAACA 
GCAACTATTA 
AAATTGCTGA 
TGGCCCATCT 
ATACTACAGA 
CTTATTTAGT 
GTGAGCTAGC 
CTGATTACCG 
CTAACCCTGG 
CAAATGATGC 
AACTTTTAGA 
TGATTTTTGA 
GAGATCATTA 
ATCGATACTT 
CAGATAAAGA 
TAGATGGTAA 
CAGTGGATAA 
GAGAACTTGA 
CTGATGAGCT 
CTAAAAAAGT 
ATGGCAAGGA 
AACAAGAACT 
TTGAGCCACG 
ACGATACTGG 
CATGGTTGAC 
GTCCAGATGT 
CGCCTCTTGA 
TTCAAAAAGC 
GAGATGTTTT 
CAGATGGCAG 
AAGCTCAAGA 
AAGAAAAGCA 
AAGAAGAAAA 
CAACCCTAGA 
TCATTTTCCA 
ATATCAAGAC 



TACTCATATT 
TCAGATTGCC 
GATGGATCAA 
TGACCAAGGT 
TTATGATGCG 
ATCAGACGTT 
TGTTTACCTC 
GCAAGTAGCC 
CAGTAAAGAA 
CGATGGCTAT 
ACCTCATGGT 
TGCTGCACAA 
CCCGACACCA 
ACAAGGTCAT 
GTCACAAAAC 
TCAACTACAC 
ACCGACTCAA 
TCATATTATC 
AGCCGGTCAA 
AGTGACACAT 
ACCATATGAT 
ATCAGGAGTT 
ACAATATGAG 
TGCTGCTGCT 
GAGTCGCAAA 
CTATTTCTAT 
AATGCTTAAA 
ACTTGCTGTA 
AAGTTCGTTT 
GCGCGATCAG 
ATGGTCTAAG 
TAAACGTGCT 
CCTAGCAGAA 
GGCCAAAGAA 
TTCATTGAGA 
GTTATTGGCA 
ACAGGCAGAT 
AGAATCAGAT 
AGATCATATC 
ACCAGAAGGT 
GCTTCAACAA 
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™£E!5 GSYQLOKHHM GLATKDNQIA YIDDSKGKAK 
ISAEEGISAE QIWKITDQG YVTSHGDHYH FYNGKVPYDA 
IISEELLMTD PNYHFKQSDV INEILDGYVI KVNGNYYVYL KPGSKRKMTP 

SSS52 wi«iLs« ESSES 

IFSPTDIIDD LGDAYLVPHG NHYHYIPKKD LSPSEUVAAO AYW<?nKmrpr 

SSSSS: DVTPNPGQGH QPoKSJS %E2SS£ 

PrS, «?™ RLD1.KYRHVE EDGLIFEPTQ VIKSNAFGYV 
SSJ SSfSi™ ^ R5fLAGQ TEDNDSGSDH SKPSOKEVTH 
"ir„f: GKGLDGKPYD TSDAYVFSKE SIHSVDKSGV TAKHGDHFHY 
SSSSSS 0 " LDEVANWVKA KGQADELAAA LDQEQGKEKP LfSESSS 

£££££ MDQLnw « i afLSlhS JSSSXJ 

DVSSLPMHAG NATYDTGSSF VIPHIDHIHV VPYSWLTPDO 

ssssssi jess™ m « ees ° s ^ pnvtpS sssss 

AEEVQKALAE GRFATPDGYI FDPRDVLAKE TFVWKDGSFS TODnnnee/D 

SKffiK !ao,AOMIDT "SSS 
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